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ASSAY FOR METHYLATION IN THE GST-Pi GENE 

Field of the Invention: 

This invention relates to an assay for diagnosis or prognosis of a 
disease or condition characterised by abnormal methylation of cytosine at a 
site or sites within the glutathione-S-transferase (GST) Pi gene and/or its 
regulatory flanking sequences. In one particular application, the invention 
provides an assay for the diagnosis or prognosis of prostate cancer. 

Background of the Invention: 

DNA METHYLATION IN MAMMALIAN GENOMES 

The only established post-synthetic modification of DNA in higher 
animal and plant genomes is methylation of the 5' position of cytosine. The 
proportion of cytosines which are methylated can vary from a few percent in 
some animal genomes (1) to 30% in some plant genomes (2). Much of this 
methylation is found at CpG sites where the symmetrically positioned 
cytosines on each strand are methylated. In plant genomes, similar 
symmetrical methylation of cytosines at CpNpG (where N can be any base) is 
also common (3). Such sites of methylation have also been identified at low 
frequency in mammalian DNA (4). 

Methylation patterns are heritable as the methylase enzyme recognises 
as a substrate, sites where a CpG dinucleotide is methylated on one strand 
but the corresponding C on the other strand is unmethylated, and proceeds to 
methylate it (5,6). Fully unmethylated sites do not normally act as 
substrates for the enzyme and hence remain unmethylated through 
successive cell divisions. Thus, in the absence of errors or specific 
intervening events, the methylase enzyme enables the stable heritability of 
methylation patterns. 

Extensive studies of gene expression in vertebrates have shown a 
strong correlation between methylation of regulatory regions of genes and 
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their lack of expression (7). Most of such studies have examined only a 
limited number of restriction enzyme sites using enzymes which fail to cut if 
their target sites are methylated. A far more limited number have been 
examined at all cytosine bases using genomic sequencing methods (8, 9). 
BISULPHITE CONVERSION OF DNA 

Treatment of single-stranded DNA with high concentrations of 
bisulphite followed by alkali leads to the selective deamination of cytosine, 
converting it to uracil (10, 11). By contrast, 5-methyl cytosines (5meC) are 
resistant to this chemical deamination. When bisulphite-treated DNA is 
copied by DNA polymerases, the uracils are read as if they were thymines 
and an adenine nucleotide incorporated, while 5meC is still read as a 
cytosine (a G being incorporated opposite). Thus, after a region of sequence 
is amplified by polymerase chain reaction (PCR), cytosines in the sequence 
which were methylated in the original DNA will be read as cytosines while 
unmethylated cytosines will be read as thymines (12, 13). 
PCR AMPLIFICATION OF METHYLATED AND UNMETHYLATED DNA 

In order to amplify bisulphite-treated DNA, primers are designed to 
anneal to the sequence produced after bisulphite treatment of the DNA. 
Since cytosines are converted to uracils, the base in the annealing primer 
will be an adenine rather than a guanine for the non-converted cytosine. 
Similarly, for the other primer of the pair, thymines replace cytosines. To 
permit quantification of levels of methylation in the target DNA, primers are 
normally chosen to avoid sites which may or may not be methylated 
(particularly CpG sites) and so may contain either a 5meC or a uracil after 
bisulphite treatment. Use of such non-selective primers allows both 
methylated and unmethylated DNAs to be amplified by PCR, providing for 
quantification of the level of methylation in the starting DNA population. 
The PCR-amplified DNA can b,e cut with an informative restriction enzyme, 
can be sequenced directly to provide an average measure of the proportion of 
methylation at any position or molecules may be cloned and sequenced (each 
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clone will be derived from amplification of an individual strand in the initial 
DNA). Such studies have indicated that, while a population of molecules 
may conform to an overall pattern of methylation, not all molecules will be 
identical and methylation may be found on only a fraction of molecules at 
some sites (13, 16). 

SELECTIVE AMPLIFICATION OF METHYLATED DNA 

Recently Herman et al. (14) described a variation of the bisulphite 
sequencing procedure to make it selective for the amplification of only 
methylated DNA. In this work, PCR primers were used which were designed 
to discriminate between the sequences produced after bisulphite-treatment of 
methylated and non-methylated target DNAs. Thus, cytosines which formed 
part of a CpG site would not be bisulphite converted and would remain as 
cytosines in the methylated DNA but would be converted to uracils in the 
unmethylated target DNA. Primers utilising these differences were designed 
and used for the amplification of methylated DNA sequences from four 
tumour suppressor genes, pl6, pl5, E-cadherin and von Hippel-Lindau. 
METHYLATION OF THE GLUTATHIONES-TRANSFERASE Pi GENE IN 
PROSTATE CANCER 

Lee et al. (15) (US Patent No 5,552,277 and International Patent 
Application No PCT/US95/09050) demonstrated that expression of the 
glutathione-S-transferase (GST) Pi gene is lost in nearly all cases of prostate 
cancer. They further showed that in twenty cases examined, using Southern 
blotting, that this loss of expression was accompanied by methylation at a 
specific restriction enzyme site (BssHII) in the promoter region of the gene. 
This methylation was not seen in normal prostate tissue or in a number of 
other normal tissues examined. In examining a prostate cancer cell line in 
which the GST-Pi gene is inactive, they also identified methylation at two 
other restriction enzyme sites, Notl and Sacll in the promoter region of the 
gene. Digestion of cell line DNAs with the enzymes Mspl and Hpall, 
indicated that die correlation of DNA methylation with lack of expression 
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was not maintained for these sites which were largely located downstream of 
the transcription start site. The nature of the data makes it difficult to reach 
conclusions on the methylation status of individual Mspl/Hpall sites. 
However, Lee et al. (18) were able to show that following HpalL digestion 
(which will cut at all unmethylated Hpall sites), a region of DNA containing 
twelve Hpall recognition sites could be amplified by PCR from tumour DNA, 
but not from normal prostate or leukocyte DNA. This indicates that some 
DNA molecules in prostate cancer are methylated at all these Hpall sites, 
while DNAs from normal prostate and leukocyte DNA must contain at least 
one of these sites unmethylated (as a single cut will render the region 
incapable of being amplified by PCR). 

The present inventors have identified and developed an alternative 
method for detecting sites of methylation present in DNA from prostate 
cancer tissue but not present in DNA from normal tissue. The method relies 
on selective amplification of a target region of the GST-Pi gene but does not 
require prior restriction with an informative restriction enzyme. 

Disclosure of the Invention: 

Thus, in a first aspect, the present invention provides a diagnostic or 
prognostic assay for a disease or condition in a subject, said disease or 
condition characterised by abnormal methylation of cytosine at a site or sites 
within the glutathione-S-transferase (GST) Pi gene and/or its regulatory 
flanking sequences, wherein said assay comprises the steps of; 

(i) isolating DNA from said subject, 

(ii) exposing said isolated DNA to reactants and conditions for the 
amplification of a target region of the GST-Pi gene and/or its regulatory 
flanking sequences which includes a site or sites at which abnormal cytosine 
methylation characteristic of the disease or condition occurs, the 
amplification being selective in that it only amplifies the target region if the 
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said site or sites at which abnormal cytosine methylation occurs is/are 
methylated, and 

(iii) determining the presence of amplified DNA. 

Since the amplification is designed to only amplify the target region if 
the said site or sites at which abnormal cytosine methylation (i.e. as 
compared to the corresponding site or sites of DNA from subjects without the 
disease or condition being assayed) occurs is/are methylated, the presence of 
amplified DNA will be indicative of the disease or condition in the subject 
from which the isolated DNA has been obtained. The assay thereby provides 
a means for diagnosing or prognosing the disease or condition in a subject. 

The step of isolating DNA may be conducted in accordance with 
standard protocols. The DNA may be isolated from any suitable body 
sample, such as cells from tissue (fresh or fixed samples), blood (including 
serum and plasma), semen, urine, lymph or bone marrow. For some types of 
body samples, particularly fluid samples such as blood, semen, urine and 
lymph, it may be preferred to firstly subject the sample to a process to enrich 
the concentration of a certain cell type (e.g. prostate cells). One suitable 
process for enrichment involves the separation of required cells through the 
use of cell-specific antibodies coupled to magnetic beads and a magnetic cell 
separation device. 

Prior to the amplifying step, the isolated DNA is preferably treated 
such that unmethylated cytosines are converted to uracil or another 
nucleotide capable of forming a base pair with adenine while methylated 
cytosines are unchanged or are converted to a nucleotide capable of forming 
a base pair with guanine. This treatment permits the design of primers 
which enable the selective amplification of the target region if the said site or 
sites at which abnormal cytosine methylation occurs is/are methylated. 

Preferably, following treatment and amplification of the isolated DNA, 
a test is performed to verify that unmethylated cytosines have been 
efficiently converted to uracil or another nucleotide capable of forming a 
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base pair with adenine, and that methylated cystosines have remained 
unchanged or efficiently converted to another nucleotide capable of forming 
a base pair with guanine. 

Preferably, the treatment of the isolated DNA involves reacting the 
isolated DNA with bisulphite in accordance with standard protocols. As will 
be clear from the above discussion of bisulphite treatment, unmethylated 
cytosines will be converted to uracil whereas methylated cytosines will be 
unchanged. Verification that unmethylated cytosines have been converted to 
uracil and that methylated cystosines have remained unchanged may be 
achieved by; 

(i) restricting an aliquot of the treated and amplified DNA with a suitable 
restriction enzyme(s) which recognise a restriction site(s) generated by or 
resistant to the bisulphite treatment, and 

(ii) assessing the restriction fragment pattern by electrophoresis. 
Alternatively, verification may be achieved by differential hybridisation 
using specific oligonucleotides targeted to regions of the treated DNA where 
unmethylated cytosines would have been converted to uracil and methylated 
cytosines would have remained unchanged. 

The amplifying step may involve polymerase chain reaction (PCR) 
amplification, ligase chain reaction amplification (20) and others (21). 

Preferably, the amplifying step is conducted in accordance with 
standard protocols for PCR amplification, in which case, the reactants will 
typically be suitable primers, dNTPs and a thermostable DNA polymerase, 
and the conditions will be cycles of varying temperatures and durations to 
effect alternating denaturation of strand duplexes, annealing of primers (e.g. 
under high stringency conditions) and subsequent DNA synthesis. 

To achieve selective PCR amplification with bisulphite-treated DNA, 
primers and conditions may be used to discriminate between a target region 
including a site or sites of abnormal cytosine methylation and a target region 
where there is no site or sites of abnormal cytosine methylation. Thus, for 
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amplification only of a target region where the said site or sites at which 
abnormal cytosine methylation occurs is/are methylated, the primers used to 
anneal to the bisulphite-treated DNA (i.e. reverse primers) will include a 
guanine nucleotide(s) at a site(s) at which it will form a base pair with a 
methylated cytosine(s). Such primers will form a mismatch if the target 
region in the isolated DNA has unmethylated cytosine nucleotide(s) (which 
would have been converted to uracil by the bisulphite treatment) at the site 
or sites at which abnormal cytosine methylation occurs. The primers used 
for annealing to the opposite strand (i.e. the forward primers) will include a 
cytosine nucleotide(s) at any site(s) corresponding to site(s) of methylated 
cytosine in the bisulphite-treated DNA. 

Preferably, the primers used for the PCR amplification are of 12 to 30 
nucleotides in length and are designed to anneal to a sequence within the 
target region that includes two to four cytosine nucleotides that are 
abnormally methylated in the DNA of a subject with the disease or condition 
being assayed. In addition, the primers preferably include a terminal 
nucleotide that will form a base pair with a cytosine nucleotide (reverse 
primer), or the guanine nucleotide opposite (forward primer), that is 
abnormally methylated in the DNA of a subject with the disease or condition 
being assayed. 

The step of amplifying is used to amplify a target region within the 
GST-Pi gene and/or its regulatory flanking sequences. The regulatory 
flanking sequences may be regarded as the flanking sequences 5' and 3' of the 
GST-Pi gene which include the elements that regulate, either alone or in 
combination with another like element, expression of the GST-Pi gene. 
Preferably, the regulatory flanking sequences consist of the 400 nucleotide 
sequence immediately 5' of the transcription start site and the 100 nucleotide 
sequence immediately 3' of the transcription stop site. 

More preferably, the step of amplifying is used to amplify a target 
region within the region of the GST-Pi gene and its regulatory flanking 
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sequences defined by (and inclusive of) CpG sites -43 to +55 (wherein the 
numbering of the CpG sites is relative to the transcription start site). The 
numbering and position of CpG sites is shown in Figure 1. 

The step of determining the presence amplified DNA may be 
conducted in accordance with standard protocols. One convenient method 
involves visualisation of a band(s) corresponding to amplified DNA, 
following gel electrophoresis. 

Preferably, the disease or condition to be assayed is selected from 
cancers, especially hormone dependent cancers such as prostate cancer, 
breast cancer and cervical cancer, and liver cancer. 

For the diagnosis or prognosis of prostate cancer, the step of amplifying 
preferably amplifies a target region within the region of the GST-Pi gene and 
its regulatory flanking sequences defined by (and inclusive of) CpG sites -43 
to +53, more preferably, -43 to +10. However, within these target regions it 
is believed that there are CpG sites which show variability in methylation 
status in prostate cancer or are methylated in other tissues. Thus, for the 
target region defined by (and inclusive of) CpG sites -43 to +10, it is 
preferred that the primers used for amplification be designed so as to 
minimise (i.e. by use of redundant primers or by avoidance of the sites) the 
influence of CpG sites -36, -32, -23, -20, -14 and a polymorphic region 
covering site -33. Further, for DNA isolated from cells other than from 
prostate tissue (e.g. blood), it is preferred that the primers used be designated 
to amplify a target region that does not include the region of the GST-Pi gene 
and its regulatory flanking sequences defined by (and inclusive of) CpG sites 
-7 to +7, or, more preferably, -13 to +8, since this may lead to false positives. 
Further preferred target regions, therefore, are within the region of the GST- 
Pi gene and its regulatory flanking sequences defined by (and inclusive of) 
CpG sites -43 to -14, -43 to -8, +9 to +53 and +1 to +53. 
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Suitable primer pairs for the diagnosis or prognosis of prostate cancer, 
include those consisting of a forward and reverse primer selected from each 
of the following groups: 

Forward Primers (i.e. anneal to the 5' end of the target region) 
CGCGAGGTTTTCGTTGGAGTTTCGTCGTC (SEQ ID NO: 1) 

CGTTATTAGTGAGTACGCGCGGTTC (SEQ ID NO: 2) 

YGGTTTTAGGGAATTTTTTTTCGC (SEQ ID NO: 3) 

YGGYGYGTTAGTTYGTTGYGTATATTTC (SEQ ID NO: 4) 

GGGAATTTTTTTTCGGGATGTTTYGGCGC (SEQ ID NO: 5) 

TTTTT AG GGGGTTYGGAGCGTTTC (SEQ ID NO: 6) 

GGTAGGTrGYGTTTATCGC (SEQ ID NO: 7) 

Reverse Primers (i.e. anneal to the extension of the forward primer) 
TCCCATCCCTCCCCGAAACGCTCCG (SEQ ID NO: 8) 

GAAACGCTCCGAACCCCGTAAAAACCGCTAACG (SEQ ID NO: 9) 
CRCCCTAAAATCCCCRAAATCRCCGCG (SEQ ID NO: 10) 

ACCCCRACRACCRCTACACCCCRAACGTCG (SEQ ID NO: 11) 

CTCTTCTAAAAAATCCCRCRAACTCCCGCCG (SEQ ID NO: 12) 
AAAACRCCCTAAAATCCCCGAAATCGCCG (SEQ ID NO: 13) 

AACTCCCRCCGACCCCAACCCCGACGACCG (SEQ ID NO: 14) 

AAAAATTCRAATCTCTCCGAATAAACG (SEQ ID NO: 15) 

AAAAACCRAAATAAAAACCACACGACG (SEQ ID NO: 16) 

wherein Y is C, T or, preferably, a mixture thereof, and R is A, G or, 
preferably, a mixture thereof. 

For the diagnosis or prognosis of liver cancer, the step of amplifying 
preferably amplifies a target region within the region of the GST-Pi gene and 
its regulatory flanking sequences defined by (and inclusive of) CpG sites -43 
to -14 and/or +9 to +53. However, within these target regions it is believed 
that there are CpG sites which show variability in methylation status in liver 
cancer or are methylated in other tissues. Thus, for the target region defined 
by (and inclusive of) CpG sites -43 to -14, it is preferred that the primers used 
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for amplification be designed so as to minimise (i.e. by use of redundant 
primers or by avoidance of the sites) the influence of CpG sites -36, -32, -23, - 
, 20, -14 and a polymorphic region covering site -33. 

It will be appreciated by persons skilled in the art, that a site or sites of 
abnormal cytosine methylation within the above identified target regions of 
the GST-Pi gene and/or its regulatory flanking sequences, could be detected 
for the purposes of diagnosing or prognosing a disease or condition 
(particularly, prostate cancer and/or liver cancer) by methods which do not 
involve selective amplification. For instance, oligonucleotide/polynucleotide 
probes could be designed for use in hybridisation studies (e.g. Southern 
blotting) with bisulphite-treated DNA which, under appropriate conditions of 
stringency, selectively hybridise only to DNA which includes a site or sites of 
abnormal methylation of cytosine(s). Alternatively, an appropriately selected 
informative restriction enzyme(s) could be used to produce restriction 
fragment patterns that distinguish between DNA which does and does not 
include a site or sites of abnormal methylation of cytosine(s). 

Thus, in a second aspect, the present invention provides a diagnostic 
or prognostic assay for a disease or condition in a subject said disease or 
condition characterised by abnormal methylation of cytosine at a site or sites 
within the glutathione-S-transferase (GST) Pi gene and/or its regulatory 
flanking sequences, wherein said assay comprises the steps of; 

(i) isolating DNA from said subject, and 

(ii) determining the presence of abnormal methylation of cytosine at a site 
or sites within the region of the GST-Pi gene and/or its regulatory flanking 
sequences defined by (and inclusive of) CpG sites -43 to +55. 

The step of isolating DNA may be conducted as described above in 
relation to the assay of the first aspect. 

Preferably, the region of the GST-Pi gene and its regulatory flanking 
sequences within which the presence of methylated cytosine(s) at a site or 
sites is determined is selected from the regions defined by (and inclusive of) 
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CpG sites -43 to +53, -43 to +10, -43 to -14, +9 to +53 and +1 to +53. 
However, within these regions, it is preferred that certain sites (namely, CpG 
sites, -36, -33, -32, -23, -20, -17 and -14) be avoided as the site or sites at 
which, for the purpose of the assay, the presence of abnormal methylation of 
cytosine is determined. 

Where the determination step is to involve selective hybridisation of 
oligonucleotide/polynucleotide/peptide-nucleic acid (PNA) probes, prior to 
the determination step, the isolated DNA is preferably treated (e.g. with 
bisulphite) such that unmethylated cytosines are converted to uracil or 
another nucleotide capable of forming a base pair with adenine while 
methylated cytosines are unchanged or are converted to a nucleotide capable 
of forming a base pair with guanine. This treatment permits the design of 
probes which allow for selective hybridisation to a target region including a 
site or sites of abnormal methylation of cytosine. 

In a third aspect, the present invention provides a primer or probe 
(sequence shown in the 5' to 3' direction) comprising a nucleotide sequence 
selected from the group consisting of: 

CGCGAGGTTTTCGTTGGAGTTTCGTCGTC (SEQ ID NO: 1) 

CGTTATTAGTGAGTACGCGCGGTTC (SEQ ID NO: 2) 

YGGTTTTAGGGAATTTTTTTTCGC (SEQ ID NO: 3) 

YGGYGYGTTAGTTYGTTGYGTATATTTC (SEQ ID NO : 4) 

GGGAATTTTTTTTCGCGATGTTTYGGCGC (SEQ ID NO: 5) 

TTTTTAGGGGGTTYGGAGCGTTTC (SEQ ID NO: 6) 

GGTAGGTTGYGTTTATCGC (SEQ ID NO : 7) 

AAAAATTCRAATCTCTCCGAATAAACG (SEQ ID NO: 8) 

AAAAACCRAAATAAAAACCACACGACG (SEQ ID NO: 9) 

TCCCATCCCTCCCCGAAACGCTCCG (SEQ ID NO: 10) 

GAAACGCTCCGAACCCCCTAAAAACCGCTAACG (SEQ ID NO: 11) 
CRCCCTAAAATCCCCRAAATCRCCGCG (SEQ ID NO: 12) 

ACCCCRACRACCRCTACACCCCRAACGTCG (SEQ ID NO: 13) 
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CTCTTCTAAAAAATCCCRCRAACTCCCGCCG (SEQ ID NO: 14) 
AAAACRCCCTAAAATCCCCGAAATCGCCG (SEQ ID NO: 15) 

AACTCCCRCCGACCCCAACCCCGACGACCG, (SEQ ID NO: 16) 

wherein Y is C, T or, preferably, a mixture thereof, and R is A, G or, 

preferably, a mixture thereof. 

The terms "comprise", "comprises" and "comprising" as used 

throughout the specification are intended to refer to the inclusion of a stated 

component, feature or step or group of components, features or steps with or 

without the inclusion of a further component, feature or step or group of 

components, features or steps. 

The invention will now be further described with reference to the 

accompanying figures and following, non-limiting examples. 

Brief description of the accompanying figures: 

Figure 1 shows the organisation and nucleotide sequence of the human 
GST-Pi gene. CpG sites are numbered relative to the transcription start site. 
Nucleotide Sequence numbering is according to the GST-Pi gene sequence of 
Genbank Accession No. M24485. 

Figure 2 shows the region of the GST-Pi gene exhibiting differential 
methylation in prostate cancer. The figure further shows the sequence and 
derivation of primers for the upstream region (from CpG site -43 to +10) and 
the common polymorphism encompassing CpG site -33 (shown above the 
sequence (p)). Underneath the GST-Pi sequence is shown the sequence of 
the derived strand after conversion of cytosines to uracil. The derived strand 
is shown either assuming all CpGs are methylated (B-M) or un-methylated 
(B-U). Below this is shown specific primers designed to selectively amplify 
the methylated sequence. 
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Figure 3 shows the methylation status of each CpG site in isolated 
DNAs; 

A - for the core promoter region through to the 3' end of the GST-Pi gene for 
the LNCaP (LN) cell line, DU145 (DU) cell line, PC3 cell line, PC3-M cell line 
and PC3-MM cell line, for DNA isolated from normal tissue samples from 
prostate cancer patients (2AN, BN and CN), for prostate tumour tissue (BC, 
CC, DC, XC, WC and 2AC) and for normal prostate tissue (Pr) from a person 
without prostate cancer; 

B - for the core promoter region and upstream sequences of the GST-Pi gene 
from normal prostate tissue (from a person without prostate cancer), from 
three prostate cancer samples (BC, CC and DC) and for a number of other 
normal tissues. Patients B and D were polymorphic at CpG site -33 and the 
level of methylation indicated in the brackets reflects methylation of the 
allele which contains the CpG For CpG sites -28 to +10, the level of 
methylation was determined by direct sequence analysis of the population of 
PCR molecules (17). For the upstream CpG sites, -56 to -30, PCR products 
were cloned and a number of individual clones sequenced (number indicated 
in brackets below the sample name). For normal tissues the level of 
methylation at each site was determined as the fraction of all clones 
containing a C at that position. For the cancer samples BC, CC and DC, the 
level of methylation shown is that among the clones which showed DNA 
methylation in the region from CpG site -43 to -30 (about half of the clones in 
each case). 

In both A and B, a blank box indicates that the site was not assayed, and a "B" 
indicates that the status of the site could not be determined (e.g. because of a 
sequence blockage or it was beyond the range of the sequencing run). The 
level of methylation detected at each site is shown, none (-), up to 25% (+), 
26-50% (++), 51-75% ( + + +) and 76-100% (+ + ++). The Gleason Grade of 
tumour samples is also shown. 
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Figure 4 provides the results of amplification of bisulphite treated 
DNAs from a variety of tissues; 

A - panel A (region covering the transcription start site) used CGPS-1 and 3 as 
outer primers and CGPS-2 and 4 as inner primers, Panel B used the outer 
primer pair CGPS-5 and 8 which encompass the region from CpG site -39 to - 
16 for first round amplification, followed by a second round of amplification 
with the CGPS-6 and 7 primers, amplifying a 140 bp fragment covering CpG 
sites -36 to -23. The lanes are 1. Brain, 2. Lung, 3. Skeletal muscle, 4. Spleen, 
5. Pancreas. 6. "Normal" Prostate Aged 85 y.o., 7. "Normal" Prostate Aged 62 
y.o„ 8. Heart, 9. Bone Marrow, 10. Blood-1, 11. Blood-2, 12. Blood-3, 13. 
Liver-1, 14. Liver-2; 

B - used the same primer pairs as that of the amplification shown in Figure 
4A Panel B, with DNA from 10 prostate cancer tissue samples (c) and 
matched normal (n) tissue samples from the same prostates (a positive 
control (+) LNCaP DNA and a negative control (-) is also shown). 
Underneath, is the Gleason grade and the level of methylation of samples 
seen with non-selective primers. 

C - used the same primer pairs as that of the amplification shown in Figure 
4A Panel B, with DNA from a range of healthy tissues, blood from prostate 
cancer patients and various cell lines. The lanes are: Panel A 1-10 blood 
samples from prostate cancer patients during radical prostacectomy; Panel B 
1. normal prostate-1, 2. normal prostate-2, 3. normal prostate-3, 4. normal 
prostate-4, 5. normal prostate-5, 6. HPV transformed prostate cell line, 7. 
blood from prostate patient PA (PSA=1000), 8. blood from prostate patient 
PB (PSA=56), 9. blood from prostate patient PC (PSA=18); and Panel C 1. 
LNCaP cell line, 2. Dul45 cell line, 3. PC-3 cell line, 4. PC-3M cell line, 5. PC- 
3MM cell line, 6. Hela cell line, 7. leukemic DNA, 8. HepG2 cell line, 9. 
human liver DNA, 10. white blood cells, 11. MRC-5 cell line. 

Figure 5 provides the results of amplification of bisulphite treated 
DNAs from seminal fluid of prostate cancer patients (c) and from men with 
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no diagnosed prostate cancer (n), using the outer primer pair CGPS-5 and 8 
and CGPS-6 and 7 as the inner primer pair. The lanes are L. LNCaP cell line 
(positive control), D. DU145 cell line, P. PC-3 cell line (negative controls), 
and M. molecular weight markers. 

Figure 6 shows the results of amplification of bisulphite treated DNAs, 
wherein the DNA has been isolated from prostate tissue slides that had been 
identified as either cancerous or diseased with benign hyperplasia (BPH). 
Selective PCR amplification was conducted using the outer primer pair 
CGPS-5 and 8 and the inner primer pair CGPS-11 and 12. 

Figure 7 shows the results of amplification of bisulphite treated DNAs, 
wherein the DNA has been isolated from prostate cancer cells enriched from 
blood samples using magnetic beads coated with an anti-epithelial antibody. 
Different numbers of LNCaP prostate cancer cells were added to the blood 
samples [7 A) or blood with added LNCaP cells stored for different times at 
4°C or room temperature prior to DNA isolation. 

Figure 8 provides the results of amplification of bisulphite treated 
DNAs, wherein the DNA has been isolated from blood samples from normal 
subjects with no known prostate complaint, from patients with benign 
hyperplasia (BPH) of the prostate and from patients with histologically 
confirmed prostate cancer. 

Figure 9 shows the results of amplification of bisulphite treated DNAs 
isolated from 20 liver cancer tissue samples. Selective PCR amplification 
was conducted using the outer primer pair CGPS-5 and 8 and the inner 
primer pair CGPS-11 and 12. 

Figure 10 shows the results of tests conducted to confirm that any 
amplified DNA products has occurred from amplification of bisulphite 
treated DNA wherein all unmethylated cytosine has been converted to uracil. 
The tests are conducted using oligonucleotides probes designed to hybridise 
to converted or non-converted target regions. 
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GENERAL METHODS AND STRATEGIES 
{ 1) Treatment of DNA witli bisulphite 

DNA for assaying was isolated from suitable sources by standard protocols 
and treated with bisulphite by well known methods (12, 13, 16). 

(2) Characterisation of Meth ylation of Individual Sites in DNA 

In order to determine the methylation status of individual cytosine 
nucleotides in target and non-target DNAs and to identify differences 
between them, bisulphite-modified DNA was amplified by PCR using primers 
designed to minimise the possibility that the methylation status of a 
particular CpG site will influence primer annealing and subsequent 
amplification (12, 13, 16). 

(3) Design of Selective Primers 

Based on the sequencing information, primers for use in the assay were 
designed to maximise the possibility that the methylation status of a 
particular CpG site would influence primer annealing and subsequent 
amplification. Specifically, the design principles followed (described for the 
"forward" PCR primer where the primer contains the same C to T (or U) 
conversions as would occur in the bisulphite-treated DNA), are listed below 
at (a) to (d): 

(a) That primers should cover sequence regions which contain a number 
of C's. Conversion of unmethylated C's to Us provides for discrimination 
between molecules which have undergone efficient bisulphite conversion 
and molecules in which C's have not reacted (e.g. because not completely 
dissolved or containing regions of secondary structure). 

(b) That at least one, but preferably at least two to four, of the C's in the 
regions should be C's (generally at CpG sites) known to be methylated in a 
high proportion of the DNA to be detected (i.e. target DNA). Thus, these C's 
will remain C's in the target DNA while being converted to U's in the non- 
target DNA. A primer which is designed to be the exactly equivalent of the 
bisulphite-converted methylated DNA will contain a mismatch at each of the 
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positions of an unmethylated C which has been converted to a U in an 
unmethylated DNA. The more mismatches that are present, the greater the 
differential hybridisation stability of the primers will be and hence the 
greater the selective difference in PCR. 

(c) That the 3' terminal base of the primer should preferably be a C 
corresponding to a C known to be methylated in the target DNA (normally 
part of a CpG dinucleotide). Correct pairing with the terminal base of the 
primer will provide for highly selective priming of target sequences 
compared with unmethylated background sequences which will form a C:A 
mismatch. 

(d) That at positions where it is known that methylation occurs in only a 
fraction of molecules in the methylated target DNA or where it is known to 
vary between target DNAs (e.g. in different tumour samples), redundancy can 
be incorporated into the primers to allow for amplification of either C or T 
from the target DNA. This same approach can be used if polymorphisms are 
known to exist in the primer region. 

For the "reverse" primer, which anneals to the converted strand, As 
replace G's at positions opposite converted C's. 
(4) Verification of Selective Target Sequence Amplification 
The amplified PCR band can be analysed to verify that it has been derived 
from DNA which has been fully bisulphite-converted (i.e. C's not methylated 
in the original DNA have been converted to U's and amplified as T's) and to 
further verify that the amplified DNA has been derived from the specific 
target DNA sequence and has the expected methylation profile (i.e. 5meC's 
not converted to T's). Methods for conducting these verifications include: 
(a) Using restriction enzyme digestion. 

In order to verify complete conversion, particular restriction enzymes 
can be used to cut the DNA. The sequence recognition sites should have the 
property that they contain no C's and are present in the sequence of the 
amplified strand after but not before bisulphite treatment. Thus, the 
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conversion of one or preferably two or more C's to U's and their amplification 
as T's in the PCR product should produce a new restriction site. Useful 
enzymes are shown in italics in Table 1 below. 

In order to verify that the target DNA sequence amplified was 
specifically methylated, use can be made of restriction enzyme sites whose 
only C nucleotides are found as CpG dinucleotides and which, if the 
sequence was methylated, would remain as CpG's in the PCR products. 
Examples of such enzymes are shown in bold in Table 1 below. BsmBl, 
which cuts the non-symmetrical sequence GAGACG can also be used. 

In some instances, enzymes which contain a C as an outer base in their 
recognition sequence can be used for verification of methylation: e.g. EcoRI 
(GAATTC) for a GAATTCG sequence or Sau3AI (GATC) for a GATCG 

i 

sequence (bold and underlined in Table 1). If a site such as one of the above 
is present in the predicted methylated, fully bisulphite-converted DNA then 
the enzyme will cut the DNA only if the original CpG dinucleotide was 
methylated, confirming the amplification of a methylated region of DNA. 
Some of the enzymes (bold and underlined in Table 1) have the potential to 
be used both for monitoring efficient conversion and CpG methylation. 
(b) Differential hybridisation to specific oligonucleotides. 

Differential hybridisation to specific oligonucleotides can be used to 
discriminate that the amplified DNA is fully reacted with bisulphite and of 
the expected methylation profile. To demonstrate complete conversion, a 
pair of oligonucleotides corresponding to the same region within the 
amplified sequence is prepared. One oligonucleotide contains T's at all C's 
which should be converted by bisulphite, while the other contains C's in 
these positions. The oligonucleotides should contain at least two or three of 
such discriminatory C's and conditions be determined which provide for 
selective hybridisation of each to its target sequence. Similar 
oligonucleotides with C or T at CpG sites and T's replacing all non-CpG C's 
are used to determine whether the specific CpG sites are methylated. 
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Additional control oligonucleotides that contain no discriminatory C's, that 
is, either no C's or a minimal number where C's are substituted with Y's 
(mixture of C and T), are used to monitor the amount of PCR product in the 
sample. The oligonucleotides can be used for direct hybridisation detection 
of amplified sequences or used to select out target molecules from the PCR- 
amplified DNA population for other detection methods. An array of such 
oligonucleotides on a DNA sequencing chip can be used to establish the 
sequence of the amplified DNA throughout the sequence region. 

(c) Single nucleotide primer extension (SNuPE). 

The technique of single nucleotide primer extension can be applied to 
the PCR products to determine whether specific sites within the amplified 
sequence contain C or T bases. In this method, a primer abutting the 
position of interest is annealed to the PCR product and primer extension 
reactions performed using either just dCTP or just dTTP. The products can 
be separated by gel electrophoresis and quantitated to determine the 
proportion of each nucleotide in the population at that position. Primers 
should be designed to quantitate conversion of C's in CpG sites and control 
C's which should not be methylated. More than one primer can be included 
in a single reaction and/or run in the same gel track as long as their sizes can 
be clearly distinguished. 

(d) Fluorescent Real-time Monitoring of PCR. 
Oligonucleotides internal to the amplified region can be used to 

monitor and quantify the amplification reaction at the same time as 
demonstrating amplification of the correct sequence. In the Fluorogenic 5' 
Nuclease PCR assay (19) the amplification reaction is monitored using a 
primer which binds internally within the amplified sequence and which 
contains both a fluorogenic reporter and a quencher. When this probe is 
bound to its target DNA it can be cleaved by the 5' nuclease activity of the 
Taq polymerase, separating the reporter and the quencher. By utilising in the 
assay an oligonucleotide which is selective for the fully bisulphite-converted 
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sequence (and/or its methylation state) both the level of amplification and its 
specificity can be monitored in a single reaction. Other related systems that 
similarly detect PCR products by hybridisation can also be used. 

Example 1; Methylation sequence profile of target and non-target GST-Pi 
DNA 

MATERIALS AND METHODS 

Figure 1 shows the organisation of the GST-Pi gene and the regions for 
which genomic sequencing was used to determine the methylation status of 
DNA isolated from prostate cancer tissue or cell lines and from normal 
prostate or other tissues. The nucleotide sequence numbering in Figure 1 is 
according to the GST-Pi sequence, Genbank Accession No. M24485. Also 
shown, within the boxes is the sequence of each amplified region, with all 
the GpG sites indicated and numbered relative to the position of the 
transcription start site. Sequence analysis demonstrated that there was an 
additional CpG dinucleotide (+9) not predicted from the published sequence. 
Also identified in the regions sequenced was a polymorphism which is 
present in a significant fraction of the samples studied. The polymorphic 
allele does not contain CpG site -33. Both the additional CpG dinucleotide 
and the polymorphism are shown in Figure 2. The nucleotide coordinates in 
Figure 2 are shown relative to the transcription start site; the first base 
shown, -434, corresponds to base 781 of the Genbank sequence, while the 
last +90, corresponds to base 1313 of the Genbank sequence. 

Table 2 lists the sequences and positions of the non-selective primers 
used for amplification (Table 2-1) and direct sequencing (Table 2-2) of 
bisulphite-treated DNA. 

DNA isolated from normal prostate tissue, prostate cancer tissue, 
prostate cancer-derived cell lines and other tissues was bisulphite treated and 
PCR reactions done by standard procedures (13). PCR products were either 
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digested with informative restriction enzymes, sequenced directly (17), or 
individual molecules cloned and sequenced by standard procedures. 
RESULTS 

In Figure 3A, the methylation status of sites in DNA from prostate 
cancer cell lines, prostate cancer tissue samples and matched normal prostate 
tissue are shown for the core promoter regions through to the 3' end of the 
gene (covering CpG sites -28 to 103). It can be seen that in normal prostate 
tissue, the core promoter region is unmethylated at all sites and that this lack 
of methylation extends through the region flanking the promoter to CpG site 
+ 33. Results of restriction enzyme digests of bisulphite-treated, PCR- 
amplified DNA indicate that this lack of methylation includes CpG sites +52 
and +53. However, in the regions further downstream which were analysed, 
CpG sites +68 to +74 and +96 to +103, DNA from normal prostate tissue 
was heavily methylated. Analysis of the prostate cancer cell line LNCaP and 
prostate cancer tissue samples demonstrates extensive methylation of the 
core promoter region; variations in the overall level of methylation probably 
reflect the presence of different levels of normal cells within the tumour 
samples. DNA from one cancer sample (2AC) was found to be completely 
unmethylated and in contrast to the other tumour samples this tumour was 
found by immunohistochemistry to still be expressing GST-Pi. Sequencing 
of the region flanking the core promoter in the LNCaP cell line and tumour 
DNAs. BC and CC, showed that methylation extended through to CpG site 
+ 33 and further restriction enzyme analysis showed that methylation 
included CpG sites +52 and +53. For one tumour sample, DC, methylation 
did not extend beyond the core promoter region and CpG sites +13 to +33, 
as well as CpG sites +52 and +53 were found to be unmethylated. It is 
notable that this tumour was of Gleason Grade 2 + 2, the lowest grade tumour 
among those analysed. For all tumour DNA samples, as for the normal DNA, 
the downstream regions of the gene, sites 68 to 74 and 96 to 103 , were 
heavily methylated. Within the promoter regions which were methylated in 
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the cancer, but not normal, tissue specific individual sites were evident 
which were either unmethylated or methylated to a much lower degree than 
surrounding methylated sites. These include sites -22 and -23 (XC), -20 (PC 3 
lines, XC and WC), -14 (PC3, XC and WC), +24 (PC3-M and MM2, CC), +25 
(LNCaP, PC3-MM2, CC). 

The results shown in Figure 3B provides, a comparison of the 
methylation state of the core promoter region and sequences upstream of the 
core promoter region in DNA isolated from normal prostate tissue and from a 
number of other normal tissues. Sequences from the PCR fragment upstream 
of the core promoter were determined by cloning and sequencing as the 
region is refractory to direct sequencing. For the cancer samples, the level of 
methylation shown is as a proportion of those clones which were methylated 
(about 50% of the total clones in both cases). In normal prostate tissue as 
well as in all other normal tissues there is extensive methylation of CpG sites 
upstream of the AT-rich repeat. Downstream of the repeat (from CpG site 
-43) minimal methylation was seen in all normal tissues except normal liver 
tissue, where there was significant methylation of CpG sites -7 through to 
+ 7. Sequences upstream of the core promoter were found to be heavily 
methylated in the prostate cancer DNAs, though again specific sites were 
undermethylated; site -32 in cancers B and D and site -36 in cancer B. 

The results therefore allow for the identification of a region of the GST- 
Pi gene and its regulatory flanking sequences, stretching from 3' of the 
polymorphic repeat region, (CpG site -43) to sites +52 and +53, which is not 
methylated in normal prostate tissue but is normally highly methylated in 
prostate cancer. In one cancer sample (D, the cancer of lowest Gleason 
Grade) the region from CpG sites +13 to +53 was not methylated. The more 
restricted region extending from CpG site -43 to +10 was methylated in all of 
the prostate cancer DNAs which showed promoter methylation. Methylation 
of part of the promoter region (CpG sites -7 to +7) was also seen in one 
normal tissue (liver) examined. Analysis of further samples of normal liver 
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DNA has shown that the level of methylation is variable and can include CpG 

sites from -13 to +8. 

DISCUSSION 

The above results are critical in identifying regions within the GST-Pi 
gene and/or its regulatory flanking sequences which can be used for the 
development of assays for the selective detection of prostate cancer cells. 
Thus, the region from CpG sites -43 to +53 lying within the boundary of 
regions methylated in normal prostate tissue can be used for the design of 
primers to detect cancer-specific methylation in prostate tissue samples. The 
region from CpG site -43 to +10 is preferred for the detection of a higher 
proportion of cancers. The region from CpG sites +13 to +53 may be used to 
detect cancer but also may be used to distinguish early (unmethylated) 
cancer from later (methylated cancer). For assays using other samples, such 
as blood, it is preferred to restrict the region chosen to exclude CpG sites -7 
to +7 or, more preferably sites -13 to +8. For example, liver cells may be 
present in the blood taken from a subject suffering liver disease, in which 
case, a false positive result could be obtained if the region chosen for 
detection of cancer-specific methylation includes CpG sites -13 to +8. 

Example 2: Design and use of selective urimers for detection of methylated 
GST-Pi DNA 

MATERIALS AND METHODS 

Sequence primers for the detection of methylated GST-Pi sequences 
from three regions, namely a region upstream of the core promoter (primers 
CGPS-5 to 9 and CGPS-11 to 13), a region partially encompassing the core 
promoter (primers CGPS-1 to 4), and a region further downstream from the 
core promoter (primers CGPS-21 to 24) are shown in Table 3 below. 

The sequence and derivation of primers for the upstream region are 
shown in Figure 2 (from CpG site -43 to CpG site +10), which also shows the 
common polymorphism encompassing CpG site -33 (see above the sequence 
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(p)). Underneath is shown the sequence of the derived strand after 
conversion of cytosines to uracil. The derived strand is shown either 
assuming all CpGs are methylated (B-M) or that none are (B-U). Below this is 
shown specific primers designed to selectively amplify the methylated 
sequence. It can be seen that all primers are designed to match perfectly to 
the treated, methylated template, but contain mismatches to the template 
derived from unmethylated DNA or the original untreated DNA. Primers 
CGPS-5, 8, 11, 12 and 13 are designed to avoid the polymorphic region and 
CpG sites which show a lower frequency of methylation in prostate cancer 
DNAs. The underlined Ts in the forward primers (and A's in the reverse 
primers) derive from bisulphite conversion of C's and provide discrimination 
against amplification of DNA which has not been efficiently converted by the 
bisulphite treatment. The bold C's in the forward primers (and G's in the 
reverse primers) are parts of CpG sites and will form base pairs with DNA 
derived from methylated sequences but form mismatches to DNA derived 
from unmethylated sequences. Redundancy is included in some positions, Y 
(= mix of C and T) in forward primers and R (= mix of A and G) in reverse 
primers to allow pairing independent of methylation status. This can allow 
for certain sites where the frequency of methylation within or between 
tumour samples is variable (eg. site -14). Forward and reverse primers for 
specific selective amplification of methylated GST-Pi sequences are shown in 
Table 3 below. 

Amplifications conducted for this example, utilised bisulphite treated 
DNAs from a variety of tissues and used two sets of PCR primers. 
Specifically, for the amplification reactions shown in Figure 4A Panel A 
(region covering the transcription start site), CGPS-1 and 3 were used as outer 
primers and CGPS-2 and 4 as inner primers. For the amplification reactions 
shown in Figure 4A Panel B and Figure 4B and 4C, the outer primer pair, 
CGPS-5 and CGPS-8 which encompass the region from CpG site 
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-39 to -16, were used for first round amplification, followed by second round 
amplification with the CGPS-6 and CGPS-7 primers, resulting in the 
amplification of a 140 bp fragment covering CpG sites -36 to -23. For the 
amplification reactions shown in Figures 5 to 8, the primer set used for the 
upstream region was the outer primer pair, CGPS-5 and CGPS-8, for first 
round amplification and the inner primer pair, CGPS-11 and CGPS-12, for 
second round amplification, resulting in the amplification of a 167 bp 
fragment covering CpG sites -38 to -23. 

For all sets of primers, PCR amplifications were performed in a buffer 
consisting of 67 mM Tris/HCl, 16.6 mM ammonium sulphate, 1.7 mg/ml BSA 
and 1.5 mM MgCl 2 , prepared in TE buffer (10 mM Tris/HCl pH 8.8, 0.1 mM 
EDTA). Reaction mixes (50 ^1) contained 200 uM of each of the four dNTPs, 
6 ng/ml of each primer and 2 units of AmpliTaq DNA polymerase (Perkin 
Elmer). For the primers CGPS-5 and 8 (first round amplification), PCR cycle 
conditions were 5 cycles of 60°C 1 min., 72°C 2 min. and 95°C 1 min., 
followed by 30 cycles of 65°C 1 min., 72°C 1.5 min. and 95°C 1 min. 
Amplification conditions for the primers CGPS-6 and 7 (second round 
amplification) were 5 cycles of 65°C 1 min., 72°C 2 min. and 95°C 1 min., 
followed by 30 cycles of 65°C 1 min., 72°C 1.5 min. and 95°C 1 min. For the 
primers CGPS-11 and 12, the amplification conditions were the same as for 
the CGPS-6 and 7 primers except that the annealing temperature was raised 
from 65°C to 70°C. 2 ul of the first round amplification reactions were used 
in 50 \i\ of second round amplification reactions. Other buffers or PCR 
amplification conditions may also be used to achieve similar efficiency and 
specificity. 

RESULTS AND DISCUSSION 

For the primers covering the core promoter region (see Figure 4A Panel 
A), amplified DNA (see arrowed band) was obtained from the positive control 
DNA (cancer B) but also from DNA from prostate tissue samples from two 
subjects who had not been diagnosed with prostate cancer. Bands of 
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amplified DNA were also seen from DNA isolated from a bone marrow and 
blood sample as well as from DNA isolated from liver tissue samples from 
subjects with no known prostate cancer. 

For the upstream amplification (see Figure 4A Panel B), no amplified 
DNA was obtained from amplification reactions conducted on DNA isolated 
from a range of healthy tissue samples nor from DNA isolated from blood 
samples of subjects with no known prostate cancer; a band of amplified DNA 
was produced from the positive control DNA (cancer B). However, while 
amplification reactions conducted on DNA isolated from one normal prostate 
tissue sample did not result in amplified DNA, amplified DNA did result 
from the same amplification reactions conducted on DNA isolated from a 
prostate tissue sample of an 82 year old subject with no known prostate 
cancer. It is possible that this subject had undiagnosed prostate cancer. 
DNA isolated from five other samples of normal prostate tissue from subjects 
with no known prostate cancer did not give rise to an amplified DNA product 
(see Figure 4C Panel B). 

In Figure 4B, the results of PCR amplification reactions are shown for 
tissue samples from patients with prostate cancer: for each sample, DNA was 
isolated from a region identified as containing cancer and from another 
region identified as grossly normal. In all cases, a clear band of amplified 
DNA was produced from amplification reactions conducted on prostate 
cancer DNA. Two of these, were cases where the proportion of methylated 
DNA was insufficient to be detected using primers designed to prime 
equivalently on methylated and unmethylated DNA. For DNA isolated from 
grossly normal tissue, the band of amplified DNA was either absent or 
present in a substantially lower amount. The presence of a band in some 
"normal" samples could derive from a low level of cancer cells in the sample. 

Amplification of DNA from samples of blood obtained from the 
abdominal cavity during surgery showed that it was possible to detect 
methylated GST-Pi sequences in a number of them. Samples of peripheral 
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blood isolated from three patients with known metastatic disease (see Fig 4C 
Panel BJ demonstrated the presence of amplifiable, methylated GST-Pi 
sequences. 

Amplified DNA products were also produced from amplification of 
DNA isolated from the LNCaP and DU145 prostate cancer cell lines, but not 
from the PC-3 series of cell lines. This latter result could be due to a low 
level of methylation in the upstream promoter region in PC-3 cells, but a 
major contributing factor is likely to be a lack of priming by the CGPS-6 
primer as PC-3 only contains the variant allele of the GST-Pi gene. 
Methylated GST-Pi sequences were also detected in DNA isolated from some 
tumour-derived cell lines of non-prostatic origin: HeLa, a cervical carcinoma, 
and HepG2, a liver carcinoma ( see Figure 4C Panel B). 

DNA was isolated from the seminal fluid (see Figure 5) of 3 prostate 
cancer patients (C) and from 5 subjects with no known prostate cancer (N), 
treated with bisulphite and amplified using primers CGPS-5 and 8 followed 
by CGPS-6 and 7. Amplified DNA products were obtained from all three 
cancer DNAs. One of the five samples from subjects without diagnosed 
prostate cancer also resulted in an amplified DNA product, but it is not clear 
if this represents a false positive or a case of undiagnosed prostate cancer in 
the particular subject. 

The use of the primer CGPS-11 avoids annealing across the 
polymorphic sequence at CpG site -33, and the combination of CGPS-5 and 8 
as outer primers followed by CGPS-11 and 12 as inner primers was found to 
give efficient amplification of prostate cancer DNA. In a first experiment (see 
Figure 6), DNA was extracted from regions of fixed tissue slides that had 
been identified as either being cancerous or being diseased with benign 
hyperplasia (BPH). DNA was isolated by incubating scraped material in 400 
ul of 7M guanidinium hydrochloride, 5 mM EDTA, 100 raM Tris/HCl pH 6.4, 
1% Triton-XlOO, 50 mg/ml proteinase K and 100 mg/ml yeast tRNA. After 
homogenisation, samples were incubated for 48 hours at 55°C then subjected 
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to five freeze/thaw cycles of dry ice for 5 min./95°C for 5 min. After 
vortexing and centrifugation for 2 min. in a microfuge, the supernatants were 
then diluted three fold, extracted with phenol/chloroform and ethanol 
precipitated. DNA isolated from samples from 6 cancer patients and 4 with 
5 BPH were amplified with either non-selective primers for the core promoter 
region (i.e. control PCR amplification with GST-9 and 10 followed by GST-11 
and 12) or CG selective primers (i.e. selective PCR amplification with CGPS-5 
and 8 followed by CGPS-11 and 12). Control PCR amplifications 
demonstrated the presence of amplifiable DNA in all samples. Using the CG 

10 selective primers, amplified DNA products were only obtained from the 
cancer DNAs. The PSA (prostate specific antigen) levels of these patients 
ranged from 4 to 145 ng/ml. For the BPH patients, the PSA levels ranged 
from 2.3 to 25 ng/ml. 

In further experiments, prostate cancer cells were first enriched from 

15 blood samples using antibodies coupled to magnetic beads followed by DNA 
isolation, bisulphite modification and PCR amplification. Cell isolation was 
achieved using Dynabeads anti-Epithelial Cell (Dynal Prod. No. 112.07) 
essentially as described by the manufacturer. The magnetic beads were 
coated with the anti-epithelial antibody mAb Ber-EP4 (22). Alternatively, 

20 magnetic beads coupled to antibodies specific for the extracellular domain of 
the prostate specific membrane antigen (23) could have been used. Whole 
blood was diluted 1:1 with Dulbecco's phosphate buffered saline (PBS) 
containing 10 mM EDTA and 40 \i\ of pre-washed magnetic beads added. 
Cells were incubated at 4°C on a rotating platform for 30 min and then the 

25 beads were collected to the side of the tube using a magnetic cell separation 
device for 4 min. The supernatant was then carefully aspirated and the 
beads resuspended in the washing solution (PBS containing 0.5% bovine 
serum albumin). Beads were then again collected to the side of the tube 
using a magnet and the supernatant carefully aspirated before conducting a 

30 further wash was done with the tube remaining in place in the magnetic 
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separation device and the supernatant aspirated. The beads were then 
resuspended in DNA isolation buffer (100 mM Tris/HCl pH 8, 25 mM EDTA, 
1% Sarkosyl, 200 mg/ml proteinase K), incubated for at least 2 h at 37° and 
DNA recovered by phenol/chloroform extraction and ethanol precipitation. 
The DNA was then finally subjected to bisulphite treatment and PCR 
amplification. 

The sensitivity of this method was tested by seeding varying numbers 
of cells of a prostate cancer cell line, LNCaP, into normal blood. As shown in 
Figure 7 A, the presence of 20 cells or more in 0.5 ml of blood could be 
reliably detected. The experiment shown in Figure 7B showed that blood 
samples containing LNCaP cells could be stored at room temperature or at 
4 °C for up to 24 hours without loss of sensitivity. 

Using magnetic bead capture followed by bisulphite treatment and 
selective PCR amplification, patient blood samples were also analysed and 
the results from a set of these are shown in Figure 8. These include blood 
samples from normal subjects with no known prostate complaint, from 
patients with benign hyperplasia (BPH) of the prostate and from patients with 
histologically confirmed prostate cancer. The control PCR amplifications 
(upper panel) used primers which amplify both methylated and 
unmethylated GST-Pi sequences. The amplifications using CG-selective 
primers are shown in the lower panel. Positive control amplifications 
(LNCaP (L) and PC3 (P)) are shown in the cancer panels and negative control 
amplifications are shown in the normal and cancer panels. 

Table 4 below summarises the results of testing of DNA from patient 
blood samples using the magnetic bead/CG selective PCR amplification 
protocol. No amplified DNA products were obtained from DNA isolated from 
normal control subjects, and only DNA isolated from one of 18 patients 
diagnosed histologically to have BPH produced amplified DNA products (this 
patient had a blood PSA level of 17 ng/ml). Of patients with confirmed 
prostate cancer, isolated DNA from 17 of 24 (70%) were PCR-positive (i.e. 
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resulted in the production of amplified DNA), indicating the presence of 
prostate cancer cells in the blood. For patients clinically staged as A and B, 
(i.e. disease confined to the prostate), cancer cells were detected in the blood 
in 6 of of the 10 cases. For 9 patients with locally invasive (Stage G] or 
5 metastatic (Stage D) disease, cancer cells were detected in the blood in every 
case. 

Since it was found that the HepG2 liver cancer cell line contained 
methylated GST-Pi sequences, samples of DNA isolated from liver cancer 
tissue was also examined. DNA isolated from 20 liver cancer samples were 

10 bisulphite treated and amplified using the CGPS-5 and 8 and CGPS-11 and 12 
primer pairs (see Figure 9). 14 of the 20 samples were PCR-positive. On the 
other hand., no amplified DNA products were produced from DNA isolated 
from 2 patients with no liver cancer (see Figure 4 and data not shown). DNA 
isolated from normal liver tissue was shown to be partially methylated in the 

15 region of the transcription start site (CpG sites -7 to + 7, see Figure 3B). 

Analysis of further samples of normal liver DNA has shown that the level of 
methylation is variable and can include CpG sites from -13 to +8. The 
primer pairs used here encompass CpG sites -39 to -16, upstream of the 
region of methylation seen in normal liver DNA. 

20 The above results show that different sets of primers designed to 

hybridise the core promoter of the GST-Pi gene or the region upstream of the 
core promoter, can reliably amplify bisulphite-treated DNA that has been 
isolated from prostate cancer cells. However, primers designed to hybridise 
to the core promoter are less selective in that DNAs isolated from a number 

25 of normal tissue samples result in amplified DNA products. Thus, primers 
designed to hybridise to regions found to be unmethylated in DNA from 
normal tissues, that is, the upstream region encompassing CpG sites -45 to -8 
and the region downstream of the promoter encompassing CpG sites +8 to 
+53, are preferred for the prognostic or diagnostic assaying of prostate 
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cancer. Additionally, primers designed to hybridise to this latter region may 
also be useful for discriminating between early and late prostate cancer. 

Example 3: Confirmation of correct amplification 

The specific oligonucleotides probes described below can be used to 
confirm that any amplified DNA products resulting from the amplification 
step of the assay is due to DNA in which all unmethylated cytosines had 
been converted to uracils. Those for the upstream PCR region can be used 
with amplified DNA products from all combinations of the CGPS-5, 6, 11, 7 
to 9, 12 and 13 forward and reverse primers. Those for the downstream PCR 
region can be used with amplified DNA products of the CGPS-21 to 24 
primers. A biotinylated version of the conversion-specific olignucleotide can 
also be used for the selective and specific capture from solution of the 
amplified DNA products generated using these primer pairs, or the 
appropriately labelled oligonucleotide can be use for real-time monitoring of 
specific PCR fragment amplification. Amplified DNA products from PCR 
amplification of bisulphite-treated DNA routinely have one strand containing 
a very high proportion of thymine nucleotides and the other strand 
containing a very high proportion of adenine nucleotides. Because of this, it 
is possible to use oligo dT (or oligo dA) as a generic conversion specific 
oligonucleotide, the annealing conditions being varied to optimise 
discrimination of converted and non-converted DNA for each PCR fragment. 

Upstream PCR region: 
Conversion oligonucleotide: 

HybC5 5'-AAACCTAAAAAATAAACAAACAA (SEQIDNO:17) 
Non-conversion oligonucleotide: 

HybU5 S'-GGGCCTAGGGAGTAAACAGACAG (SEQIDNO: 18) 
Conversion neutral oligonucleotide: 

HybN5: 5'-CCTTTCCCTCTTTCCCARRTCCCCA (SEQID NO: 19) 
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Downstream FCR region: 
Conversion oligonucleotide: 

HyBC3 5 '-TTTGGTATTTTTTTTCGGGTTTTAG (SEQID NO: 20) 
Non-conversion oligonucleotide: 

HybU3 5'-CTTGGCATCCTCCCCCGGGCTCCAG (SEQ ID NO: 21) 
Conversion neutral oligonucleotide: 

HybN3 5'-GGYAGGGAAGGGAGGYAGGGGYTGGG (SEQ ID NO: 22) 

To demonstrate the selectivity of such hybridisations, a series of DNAs 
were spotted onto nylon membranes and hybridised with conversion and 
non-conversion specific oligonucleotide probes for the upstream PCR region 
as well as a control oligonucleotide. The DNAs included: 

(i) individual cloned PCR products from amplification of the upstream region 
that contained differing numbers of converted cytosines in the region 
complementary to the probe (see Figure 10, where the number of converted 
cytosines, out of 10, is shown (Column 1 and top 2 spots of Column 2). n.b. 
the two clones containing 10/10 converted bases end adjacent to and do not 
contain the sequences complementary to the control oligonucleotide); and 

(ii) PCR products from cancer patients and patients with benign hyperplasia 
that had been amplified from bisulphite-treated DNA using CG-selective 
primers (CGPS-5 and 8, followed by CGPS 11 and 12) (see figure 10, where 
these are labelled as Cancer Samples 1 to 4 (lower part of column 2) and BPH 
samples 1 to 4 (Column 3)). 

Hybridisations with kinased oligonucleotide probes were performed in 
Express-Hyb buffer (Clontech) at 45°C for two hours followed by four 20 min. 
washes in 2X SSC, 0.1% SDS at 45°C before phosphorimage analysis. 

Hybridisations with the control oligonucleotide probes provides an 
estimate of the amount of DNA in the sample. As expected, none of the PCR 
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amplifications of BPH samples produced significantly detectable product, 
while 3 of 4 cancer samples gave a strong signal and one a very weak one. 

Hybridisations with the conversion-specific probe showed a clear 
signal for the plasmid DNAs that matched the probe perfectly and for the 3 
cancer samples for which there was stronger hybridisation with the control 
oligonucleotide probe. The fourth cancer sample that gave a very weak 
signal with the control oligonucleotide was barely detectable with the 
conversion-specific probe. This could have been due to the low level of DNA 
or, possibly, the presence of partially-converted DNA molecules. None of the 
plasmid clones that had mismatches to the conversion-specific probe gave a 
significant signal. The probe for unconverted DNA hybridised clearly with 
plasmid DNAs that had 0, 1 or 2 bases converted, but not with samples that 
had 8 or 10 converted bases. The hybridisations also indicated that there was 
a low level amplification of unconverted DNA in two BPH and one cancer 
sample (in this latter case there was a strong signal from probe for fully 
converted DNA, indicating that the PCR product was predominantly derived 
from properly converted DNA). 

The results show that oligonucleotides of the type used here can 
discriminate between molecules that have been efficiently converted by 
bisulphite and those that have not. They can be used in a number of formats 
for detection of PCR products or prior to PCR or other detection methods to 
select out efficiently converted molecules of the target region from the total 
DNA population. The same approach can be used with primers that 
distinguish CpG methylated DNAs (or their derivatives containing C's) from 
unmethylated DNAs (containing U's or their derivatives containing T's). 
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Primer Sequence [5'-3') 


CGCGAGGlTlTCGTlUiAGTITCGTCGTC (SEQ ID NO: 1] 


CGTTATTAGTGAGTACGCGCGGTTC (SEQ ID NO: 2) 


TCCCATCCCTCCCCGAAACGCTCCG (SEQ ID NO: 8) 


GAAACGCTCCGAACCCCCTAAAAACCGCTAACG (SEQ ID NO: 9) 




YGGTTTTAGGGAATTTTTTTTCGC (SEQ DO NO: 3) 


YGGYGYGTTAGTTYGTTGYGTATATTTC (SEQ ID NO: 4) 


GGGAATITITITI'CGCGATGTTTYGGCGC (SEQ ID NO: 5) 


CRCCCTAAAATCCCCRAAATCRCCGCG (SEQ ID NO: 10) 


ACCCCRACRACCRCTACACCCCRAACGTCG (SEQ ID NO: 11) 


CTCTTCTAAAAAATCCCRCRAACTCCCGCCG (SEQ ID NO: 12) 


AAAACRCCCTAAAATCCCCGAAATCGCCG (SEQ ID NO: 13) 


AACTCCCRCCGACCCCAACCCCGACGACCG (SEQ ID NO: 14) 
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CN 
rH 


CO 
rH 






rH 

CN 


CN 
CN 


CO 
tN 


CN 


Prim 




CGPS- 


CGPS- 


CGPS- 


CGPS 




CGPS- 


CGPS- 


CGPS- 


CGPS- 


CGPS- 


CGPS- 


CGPS- 


CGPS- 




CGPS- 


CGPS- 


CGPS- 


CGPS- 
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Assay negative 


Assay positive 








Normal subjects 


10 


0 








Benign hyperplasia 


17 


1 








Cancer (total) 


7 


17 








Stage A 


1 


3 


Stage B 


3 


3 


Stage C 


0 


2 


Stage D 


0 


7 


Stage not defined 


3 


2 
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It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
5 invention as broadly described. The present embodiments are, therefore, to 
be considered in all respects as illustrative and not restrictive. 
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Sequence Listings: 

Applicant: Commonwealth Scientific and Industrial Research 
Organisation 

Title: Diagnostic assay 

Prior Application Number: PP3129 

Prior Application Filing Date: 1998-04-23 

Number of SEQ ID NOs : 59 

Software: Patentln Ver. 2.1 

SEQ ID NO: 1 
Length: 2 9 
Type : DNA 

Organism: Homo sapiens 
Sequence: 1 

cgcgaggttt tcgttggagt ttcgtcgtc 29 

SEQ ID NO: 2 
Length: 25 
Type : DNA 

Organism: Homo sapiens 
Sequence: 2 

cgttattagt gagtacgcgc ggttc 25 

SEQ ID NO: 3 
Length: 24 
Type: DNA 

Organism: Homo sapiens 
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Sequence: 3 

yggttttagg gaattttttt tcgc 24 

SEQ ID NO: 4 
Length: 28 
Type: DNA 

Organism: Homo sapiens 
Sequence: 4 

yggygygtta gttygttgyg tatatttc 28 



SEQ ID NO: 5 
Length: 2 9 
Type: DNA 

Organism: Homo sapiens 
Sequence: 5 

gggaattttt tttcgcgatg tttyggcgc 

SEQ ID NO: 6 
Length: 2 4 
Type: DNA 

Organism: Homo sapiens 



29 



Sequence: 6 

tttttagggg gttyggagcg tttc 24 

SEQ ID NO: 7 
Length: 19 
Type : DNA 

Organism: Homo sapiens 
Sequence: 7 

ggtaggttgy gtttatcgc 19 



SEQ ID NO: 8 
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Length: 27 
Type : DNA 

Organism: Homo sapiens 
Sequence: 8 

aaaaattcra atctctccga ataaacg 27 

SEQ ID NO: 9 
Length: 27 
Type : DNA 

Organism: Homo sapiens 
Sequence: 9 

aaaaaccraa ataaaaacca cacgacg 27 

SEQ ID NO: 10 
Length: 25 
Type : DNA 

Organism: Homo sapiens 
Sequence: 10 

tcccatccct ccccgaaacg ctccg 25 

SEQ ID NO: 11 y 
Length: 33 
Type: DNA 

Organism: Homo sapiens 
Sequence: 11 

gaaacgctcc gaacccccta aaaaccgcta acg 33 

SEQ ID NO: 12 
Length: 27 
Type: DNA 

Organism: Homo sapiens 
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Sequence: 12 

crccctaaaa tccccraaat crccgcg 27 



SEQ ID NO: 13 
Length: 30 
Type : DNA 

Organism: Homo sapiens 
Sequence: 13 

accccracra ccrctacacc ccraacgtcg 30 



SEQ ID NO: 14 
Length: 31 
Type: DNA 

Organism: Homo sapiens 
Sequence: 14 

ctcttctaaa aaatcccrcr aactcccgcc g 31 

SEQ ID NO: 15 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 15 

aaaacrccct aaaatccccg aaatcgccg 29 

SEQ ID NO: 16 
Length: 30 
Type : DNA 

Organism: Homo sapiens 
Sequence: 16 

aactcccrcc gaccccaacc ccgacgaccg 30 



SEQ ID NO: 17 



WO 99/55905 



PCT/AU99/00306 



49 

Length: 23 
Type : DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds bisulf ite-converted human GST-Pi gene 

Sequence: 17 

aaacctaaaa aataaacaaa caa 23 

SEQ ID NO: 18 
Length: 23 
Type : DNA 

Organism: Artificial Sequence 
Feature : 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds non-converted human GST-Pi gene 

Sequence: 18 

gggcctaggg agtaaacaga cag 23 

SEQ ID NO: 19 
Length: 25 
Type : DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds human GST-Pi gene 



Sequence: 19 
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cctttccctc tttcccarrt cccca 25 

SEQ ID- NO: 20 
Length: 25 
Type : DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial 
Sequence Oligonucleotide 

which binds bisulf ite-converted human GST-Pi gene 

Sequence: 20 

tttggtattt tttttcgggt tttag 25 

SEQ ID NO: 21 
Length: 25 
Type : DNA 

Organism: Artificial Sequence 
Feature : 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds non-converted human GST-Pi gene 

Sequence: 21 

cttggcatcc tcccccgggc tccag 25 

SEQ ID NO: 22 
Length: 26 
Type : DNA 

Organism: Artificial Sequence 



Feature : 
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Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds human GST-Pi gene 



Sequence: 22 

ggyagggaag ggaggyaggg gytggg 

SEQ ID NO: 23 
Length: 31 
Type : DNA 

Organism: Homo sapiens 
Sequence: 23 

ttatgtaata aatttgtata ttttgtatat g 



26 



31 



SEQ ID NO: 24 
Length: 25 
Type : DNA 

Organism: Homo sapiens 
Sequence: 24 

tgtagattat ttaaggttag gagtt 

SEQ ID NO: 25 
Length: 27 
Type : DNA 

Organism: Homo sapiens 
Sequence: 25 

aaacctaaaa aataaacaaa caacaaa 

SEQ ID NO: 26 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
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aaaaaacctt tccctctttc ccaaatccc 29 

SEQ ID NO: 27 
Length: 27 
Type: DNA 

Organism: Homo sapiens 
Sequence: 27 

tttgttgttt gtttattttt taggttt 27 

SEQ ID NO: 28 
Length: 2 6 
Type: DNA 

Organism: Homo sapiens 
Sequence: 2 8 

gggatttggg aaagagggaa aggttt 26 

SEQ ID NO: 29 
Length: 2 4 
Type : DNA 

Organism: Homo sapiens 
Sequence: 29 

actaaaaact ctaaacccca tccc 24 

SEQ ID NO: 30 
Length: 24 
Type: DNA 

Organism: Homo sapiens 



Sequence: 30 

aacctaatac taccttaacc ccat 



24 
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SEQ ID NO: 31 
Length: 33 
Type: DNA 

Organism: Homo sapiens 
Sequence: 31 

aatcctcttc ctactatcta tttactccct aaa 33 

SEQ ID NO:* 32 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 32 

aaaacctaaa aaaaaaaaaa aaacttccc 29 

SEQ ID NO: 33 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 33 

ttggttttat gttgggagtt ttgagtttt 29 

SEQ ID NO: 34 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 34 

ttttgtgggg agttggggtt tgatgttgt 29 

SEQ ID NO: 35 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
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Sequence: 35 

ggtttagagt ttttagtatg gggttaatt 29 

SEQ ID NO: 36 
Length: 20 
Type : DNA 

Organism: Homo sapiens 
Sequence: 36 

tagtattagg ttagggtttt 20 

SEQ ID NO: 37 
Length: 2 9 
Type : DNA 

Organism: Homo sapiens 
Sequence: 37 

aactctaacc ctaatctacc aacaacata 29 

SEQ ID NO: 38 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 38 

caaaaaactt taaataaacc ctcctacca 29 



SEQ ID NO: 39 
Length: 32 
Type: DNA 

Organism: Homo sapiens 
Sequence: 39 

gttttgtggt taggttgttt ttcaggtgtt ag 



32 
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SEQ ID NO: 40 
Length: 30 
Type: DNA 

Organism: Homo sapiens 
Sequence: 40 

gttttgagta tttgttgtgt ggtagttttt 30 



SEQ ID NO: 41 
Length: 30 
Type : DNA 

Organism: Homo sapiens 
Sequence: 41 

ttaatataaa taaaaaaaat atatttacaa 30 

SEQ ID NO: 42 
Length: 34 
Type: DNA 

Organism: Homo sapiens 
Sequence: 42 

caacccccaa tacccaaccc taatacaaat actc 34 

SEQ ID NO: 43 
Length: 26 
Type : DNA 

Organism: Homo sapiens 
Sequence: 43 

ggttttagtt tttggttgtt tggatg 26 



SEQ ID NO: 44 
Length: 26 
Type: DNA 
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Organism: Homo sapiens 



Sequence: 44 

tttttttgtt tttagtatat gtgggg 

SEQ ID NO: 45 
Length: 30 
Type: DNA 

Organism: Homo sapiens 
Sequence: 45 

atactaaaaa aactattttc taatcctcta 

SEQ ID NO: 4 6 
Length: 29 
Type : DNA 

Organism: Homo sapiens 



Sequence: 4 6 

ccaaactaaa aactccaaaa aaccactaa 

SEQ ID NO: 47 
Length: 38 
Type : DNA 

Organism: Artificial Sequence 



Feature: 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 

Sequence : 47 

tgtaaaacga cggccagtgg gatttgggaa agagggaa 



SEQ ID NO: 48 
Length: 38 
Type : DNA 
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Organism: Artificial Sequence 
Feature : 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 

Sequence: 4 8 

tgtaaaacga cggccagttg ttgggagttt tgagtttt 38 

SEQ ID NO: 49 
Length: 31 
Type : DNA 

Organism: Artificial Sequence 
Feature : 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 

Sequence: 49 

tgtaaaacga cggccagtta gtattaggtt a 31 

SEQ ID NO: 50 
Length: 37 
Type: DNA 

Organism: Artificial Sequence 



Feature: 



Other Information : 



Description of Artificial Sequence: M13-human 



GST-Pi oligonucleotide 



Sequence: 50 



tgtaaaacga cggccagtgt tttgagtatt tgttgtg 



37 



SEQ ID NO: 51 
Length: 35 
Type: DNA 
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Organism: Artificial Sequence 



Feature : 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 

Sequence: 51 

tgtaaaacga cggccagtgt ttttagtata tgtgg 35 

SEQ ID NO: 52 
Length: 4 99 
Type : DNA 

Organism: Homo sapiens 
Sequence: 52 

tgcagatcac ctaaggtcag gagttcgaga ccagcccggc caacatggtg aaaccccgtc 60 
tctactaaaa atacaaaaat cagccagatg tggcacgcac ctataattcc acctactcgg 120 
gaggctgaag cagaattgct tgaacccgag aggcggaggt tgcagtgagc cgccgagatc 180 
gcgccactgc actccagcct gggccacagc gtgagactac gtcataaaat aaaataaaat 240 
aacacaaaat aaaataaaat aaaataaaat aaaataaaat aataaaataa aataaaataa 300 
aataaaataa aataaaataa agcaatttcc tttcctctaa gcggcctcca cccctctccc 360 
ctgccctgtg aagcgggtgt gcaagctccg ggatcgcagc ggtcttaggg aatttccccc 420 
cgcgatgtcc cggcgcgcca gttcgctgcg cacacttcgc tgcggtcctc ttcctgctgt 480 
ctgtttactc cctaggccc 499 



SEQ ID NO: 53 
Length: 316 
Type : DNA 

Organism: Homo sapiens 



Sequence: 53 

gggacctggg aaagagggaa aggcttcccc 
agggcgcccc tctgcggccg acgcccgggg 
gagtccgcgg gaccctccag aagagcggcc 
gggcgggacc acccttataa ggctcggagg 
gcagtcttcg ccaccagtga gtacgcgcgg 



ggccagctgc gcggcgactc cggggactcc 60 
tgcagcggcc gccggggctg gggccggcgg 120 
ggcgccgtga ctcagcactg gggcggagcg 180 
ccgcgaggcc ttcgctggag tttcgccgcc .240 
cccgcgtccc cggggatggg gctcagagct 300 
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cccagcatgg ggccaa 3 16 

SEQ ID NO: 54 
Length: 603 
Type ; DNA 

Organism: Homo sapiens 
Sequence: 54 

cagcatcagg cccgggctcc cggcagggct cctcgcccac ctcgagaccc gggacggggg 60 
cctaggggac ccaggacgtc cccagtgccg ttagcggctt tcagggggcc cggagcgcct 120 
cggggaggga tgggaccccg ggggcgggga gggggggcag gctgcgctca ccgcgccttg 180 
gcatcctccc ccgggctcca gcaaactttt ctttgttcgc tgcagtgccg ccctacaccg 240 
tggtctattt cccagttcga ggtaggagca tgtgtctggc agggaaggga ggcaggggct 300 
ggggctgcag cccacagccc ctcgcccacc cggagagatc cgaaccccct tatccctccg 360 
tcgtgtggct tttaccccgg gcctccttcc tgttccccgc ctctcccgcc atgcctgctc 420 
cccgccccag tgttgtgtga aatcttcgga ggaacctgtt tacctgttcc ctccctgcac 480 
tcctgacccc tccccgggtt gctgcgaggc ggagtcggcc cggtccccac atctcgtact 540 
tctccctccc cgcaggccgc tgcgcggccc tgcgcatgct gctggcagat cagggccaga 600 



SEQ ID NO: 55 
Length: 266 
Type : DNA 

Organism: Homo sapiens 



Sequence: 55 

gctctgagca cctgctgtgt ggcagtctct 

tcccaggctg gggctcacag acagccccct 

atcaggcgcc cagtcacgcg gcctgctccc 

gaccagcagg aggcagccct ggtggacatg 

aaatacatct ccctcatcta caccaa 

SEQ ID NO: 56 
Length: 287 
Type : DNA 

Organism: Homo sapiens 



catccttcca cgcacatcct cttcccctcc 60 
ggttggccca tccccagtga ctgtgtgttg 120 
ctccacccaa ccccagggct ctatgggaag 180 
gtgaatgacg gcgtggagga cctccgctgc 240 

266 
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Sequence: 56 

tccccctgct ctcagcatat gtggggcgcc tcagtgcccg gcccaagctc aaggccttcc 60 
tggcctcccc tgagtacgtg aacctcccca tcaatggcaa cgggaaacag tgagggttgg 120 
ggggactctg agcgggaggc agagtttgcc ttcctttctc caggaccaat aaaatttcta 180 
agagagctac tatgagcact gtgtttcctg ggacggggct taggggttct cagcctcgag 240 
gtcggtggga gggcagagca gaggactaga aaacagctcc tccagca 287 

SEQ ID NO: 57 
Length: 524 
Type: DNA 

Organism: Homo sapiens 



Sequence: 57 

ataaaataaa ataaaataaa ataaagcaat 
tcccctgccc tgtgaagcgg gtgtgcaagc 
cccccgcgat gtcccggcgc gccagttcgc 
ctgtctgttt actccctagg ccccgctggg 
ccagctgcgc ggcgactccg gggactccag 
cagcggccgc cggggctggg gccggcggga 
cgccgtgact cagcactggg gcggagcggg 
gcgaggcctt cgctggagtt tcgccgccgc 
cgcgtccccg gggatggggc tcagagctcc 

SEQ ID NO: 5 8 
Length: 524 
Type: DNA 

Organism: Homo sapiens 



ttcctttcct ctaagcggcc tccacccctc 60 
tccgggatcg cagcggtctt agggaatttc 120 
tgcgcacact tcgctgcggt cctcttcctg 180 
gacctgggaa agagggaaag gcttccccgg 240 
ggcgcccctc tgcggccgac gcccggggtg 300 
gtccgcggga ccctccagaa gagcggccgg 360 
gcgggaccac ccttataagg ctcggaggcc 420 
agtcttcgcc accagtgagt acgcgcggcc 480 
cagcatgggg ccaa 524 



Sequence: 58 

ataaaataaa ataaaataaa ataaagtaat 
ttttttgttt tgtgaagtgg gtgtgtaagt 
tttttgtgat gttttggtgt gttagtttgt 
ttgtttgttt attttttagg ttttgttggg 
ttagttgtgt ggtgattttg gggattttag 
tagtggttgt tggggttggg gttggtggga 



tttttttttt ttaagtggtt tttatttttt 60 
tttgggattg tagtggtttt agggaatttt 120 
tgtgtatatt ttgttgtggt tttttttttg 180 
gatttgggaa agagggaaag gtttttttgg 240 
ggtgtttttt tgtggttgat gtttggggtg 300 
gtttgtggga ttttttagaa gagtggttgg 360 
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tgttgcgatt tagtattggg gtggagtggg gtgggattat ttttataagg tttggaggtt 420 
gtgaggtttt tgttggagtt ttgttgttgt agtttttgtt attagtgagt atgtgtggtt 480 
tgtgtttttg gggatggggt ttagagtttt tagtatgggg ttaa 524 

SEQ ID NO: 59 
Length: 524 
Type: DNA 

Organism: Homo sapiens 



Sequence: 59 

ataaaataaa ataaaataaa ataaagtaat 
ttttttgttt tgtgaagcgg gtgtgtaagt 
ttttcgcgat gtttcggcgc gttagttcgt 
ttgtttgttt attttttagg tttcgttggg 
ttagttgcgc ggcgatttcg gggattttag 
tagcggtcgt cggggttggg gtcggcggga 
cgtcgtgatt tagtattggg gcggagcggg 
gcgaggtttt cgttggagtt tcgtcgtcgt 
cgcgttttcg gggatggggt ttagagtttt 



tttttttttt ttaagcggtt tttatttttt 60 
ttcgggatcg tagcggtttt agggaatttt 120 
tgcgtatatt tcgttgcggt tttttttttg 180 
gatttgggaa agagggaaag gttttttcgg 240 
ggcgtttttt tgcggtcgac gttcggggtg 300 
gttcgcggga ttttttagaa gagcggtcgg 360 
gcgggattat ttttataagg ttcggaggtc 420 
agttttcgtt attagtgagt acgcgcggtt 480 
tagtatgggg ttaa 524 
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Claims: 

1. A diagnostic or prognostic assay for a disease or condition in a subject, 
said disease or condition characterised by abnormal methylation of cytosine 
at a site or sites within the glutathione-S-transferase (GST) Pi gene and/or its 
regulatory flanking sequences, wherein said assay comprises the steps of; 

(i) isolating DNA from said subject, 

(ii) exposing said isolated DNA to reactants and conditions for the 
amplification of a target region of the GST-Pi gene and/or its regulatory 
flanking sequences which includes a site or sites at which abnormal cytosine 
methylation characteristic of the disease or condition occurs, the 
amplification being selective in that it only amplifies the target region if the 
said site or sites at which abnormal cytosine methylation occurs is/are 
methylated, and 

(iii) determining the presence of amplified DNA. 

2. An assay according to claim 1, wherein the amplifying step is used to 
amplify a target region within the GST-Pi gene and/or its regulatory flanking 
sequences, wherein the regulatory flanking sequences consist of the 400 
nucleotide sequence immediately 5' of the transcription start site of the GST- 
Pi gene and the 100 nucleotide sequence immediately 3' of the transcription 
stop site of the GST-Pi gene. 

3. An assay according to claim 1 or 2, wherein the amplifying step is used 
to amplify a target region within the region of the GST-Pi gene and its 
regulatory flanking sequences defined by (and inclusive of) CpG sites -43 to 
+55. 

4: An assay according to any one of the preceding claims, wherein prior 
to the amplifying step, the isolated DNA is treated such that unmethylated 
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cytosines are converted to uracil or another nucleotide capable of forming a 
base pair with adenine while methylated cytosines are unchanged or are 
converted to a nucleotide capable of forming a base pair with guanine. 

5. As assay according to any one of the preceding claims, wherein the 
amplifying step involves polymerase chain reaction (PCR) amplification. 

6. An assay according to claim 5, wherein said PCR amplification utilises 
a reverse primer including guanine at at least one site whereby, upon the 
reverse primer annealing to the treated DNA, said guanine will either form a 
base pair with a methylated cytosine (or another nucleotide to which the 
methylated cytosine has been converted through said treatment) if present, or 
will form a mismatch with uracil (or another nucleotide to which 
unmethylated cytosine has been converted through said treatment). 

7. An assay according to claim 6, wherein said PCR amplification utilises 
a forward primer including cytosine at at least one site(s) corresponding to 
cytosine nucleotides that are abnormally methylated in the DNA of a subject 
with the disease or condition being assayed. 

8. An assay according to claim 7, wherein the primers are of 12 to 30 
nucleotides in length. 

9. An assay according to claim 8, wherein the primers are selected so as 
to anneal to a sequence within the target region that includes two to four 
cytosine nucleotides that are abnormally methylated in the isolated DNA of a 
subject with the disease or condition being assayed. 

10. An assay according to claim 4, wherein the treatment of the isolated 
DNA involves reacting the isolated DNA with bisulphite. 
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11. As assay according to claim 10, wherein the amplifying step involves 
polymerase chain reaction (PCR) amplification. 

12. An assay according to claim 11, wherein said PCR amplification 
utilises a reverse primer including guanine at at least one site whereby, upon 
the reverse primer annealing to the treated DNA, said guanine will either 
form a base pair with a methylated cytosine if present, or will form a 
mismatch with uracil. 

13. An assay according to claim 12, wherein said PCR amplification 
utilises a forward primer including cytosine at at least one site(s] 
corresponding to cytosine nucleotides that are abnormally methylated in the 
isolated DNA of a subject with the disease or condition being assayed. 

14. An assay according to claim 13, wherein the primers are of 12 to 30 
nucleotides in length. 

15. An assay according to claim 14, wherein the primers are selected so as 
to anneal to a sequence within the target region that includes two to four 
cytosine nucleotides that are abnormally methylated in the DNA of a subject 
with the disease or condition being assayed. 

16. An assay according to any one of the preceding claims, wherein said 
DNA is isolated from cells from tissue, blood (including serum and plasma), 
semen, urine, lymph or bone marrow. 

17. An assay according to any one of the preceding claims, wherein the 
disease or condition to be assayed is selected from cancers. 
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18. An assay according to claim 17, wherein the disease or condition to be 
assayed is selected from prostate cancer, breast cancer, cervical cancer and 
liver cancer. 

5 19. An assay according to claim 18, wherein the disease or condition to be 
assayed is prostate cancer. 

20. An assay according to claim 19, wherein the amplifying step is used to 
amplify a target region within the region of the GST-Pi gene and its 

10 regulatory flanking sequences defined by (and inclusive of) CpG sites -43 to 
+53. 

21. An assay according to claim 19, wherein the amplifying step is used to 
amplify a target region within the region of the GST-Pi gene and its 

15 regulatory flanking sequences defined by (and inclusive of) CpG sites -43 to 
+ 10. 

22. An assay according to claim 19, wherein the amplifying step is used to 
amplify a target region within the region of the GST-Pi gene and its 

20 regulatory flanking sequences defined by (and inclusive of) CpG sites -43 to 
-14. 

23. An assay according to claim 19, wherein the amplifying step is used to 
amplify a target region within the region of the GST-Pi gene and its 

25 regulatory flanking sequences defined by (and inclusive of) CpG sites -43 to 
-8. 

24. An assay according to claim 19, wherein the amplifying step is used to 
amplify a target region within the region of the GST-Pi gene and its 
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regulatory flanking sequences defined by (and inclusive of) CpG sites +9 to 
+53. 

25. An assay according to claim 19, wherein the amplifying step is used to 
amplify a target region within the region of the GST-Pi gene and its 
regulatory flanking sequences defined by (and inclusive of) CpG sites +1 to 
+53. 

26. An assay according to claim 19, wherein the amplifying step involves 
PCR amplification using primer pairs consisting of a foward and reverse 
primer selected from each of the following groups: 

Forward Primers 

CGCGAGGTTTTCGTTGGAGTTTCGTCGTC (SEQ ID NO: 1) 

CGTTATTAGTGAGTACGCGCGGTTC (SEQ ID NO: 2) 

YGGTTTTAGGGAATTTTTTTTCGC (SEQ ID NO: 3) 

YGGYGYGTTAGTTYGTTGYGTATATTTC (SEQ ID NO: 4) 

GGGAATTTTTTTTCGCGATGTTTYGGCGC (SEQ ID NO: 5) 

TTTTTAGGGGGTTYGGAGCGTTTC (SEQ ID NO: 6) 

GGTAGGTTGYGTTTATCGC ( S EQ ID NO: 7) 

Reverse Primers 

TCCCATCCCTCCCCGAAACGCTCCG (SEQ ID NO: 8) 

GAAACGCTCCGAACCCCCTAAAAACCGCTAACG (SEQ ID NO: 9) 
CRCCCTAAAATCCCCRAAATCRCCGCG (SEQ ID NO: 10) 

ACCCCRACRACCRCTACACCCCRAACGTCG (SEQ ID NO: 11) 

CTCTTCTAAAAAATCCCRCRAACTCCCGCCG (SEQ ID NO: 12) 
AAAACRCCCTAAAATCCCCGAAATCGCCG (SEQ ID NO: 13) 

AACTCCCRCCGACCCCAACCCCGACGACCG (SEQ ID NO: 14) 

AAAAATTCRAATCTCTCCGAATAAACG (SEQ ID NO: 15) 

AAAAACCRAAATAAAAACCACACGACG (SEQ ID NO: 16), 

wherein Y is C, T or a mixture thereof, and R is A, G or a mixture thereof. 
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27. An assay according to claim 19, wherein the amplifying step involves 
PCR amplification using primer pairs consisting of a foward and reverse 
primer selected from each of the following groups: 

Forward Primers 

CGCGAGGTTTTCG1TGGAGTTTCGTCGTC (SEQ ID NO: 1) 

CGTTATTAGTGAGTACGCGCGGTTC (SEQ ID NO: 2) 

Reverse Primers 

TCCCATCCCTCCCCGAAACGCTGCG (SEQ ID NO: 8) 

GAAACGCTCCGAACCCCCTAAAAACCGCTAACG (SEQ ID NO: 9). 

28. An assay according to claim 19, wherein the amplifying step involves 
PCR amplification using primer pairs consisting of a foward and reverse 
primer selected from each of the following groups: 

Forward Primers 

YGGTTTTAGGGAATTTTTTTTCGC (SEQ ID NO: 3) 

YGGYGYGTTAGTTYGTTGYGTATATTTC (SEQ ID NO: 4) 

GGGAATTTTTTTTCGCGATGTTTYGGCGC (SEQ ID NO: 5) 
Reverse Primers 

CRCCCTAAAATCCCCRAAATCRCCGCG (SEQ ID NO: 10) 

ACCCCRACRACCRCTACACCCCRAACGTCG (SEQ ID NO: 11) 

CTCTTCTAAAAAATCCCRCRAACTCCCGCCG (SEQ ID NO: 12) 

AAAACRCCCTAAAATCCCCGAAATCGCCG (SEQ ID NO: 13) 

AACTCCCRCCGACCCCAACCCCGACGACCG (SEQ ID NO: 14), 
wherein Y is C, T or a mixture thereof and R is A, G or a mixture thereof. 



29. An assay according to claim 19, wherein the amplifying step involves 
PCR amplification using primer pairs consisting of a forward and reverse 
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primer selected from each of the following groups: 
Forward Primers 

1TTTTAGGGGGTTYGGAGCGTTTC 
GGTAGGTTGYGTTTATCGC 



(SEQ ID NO: 6) 
(SEQ ID NO: 7) 



Reverse Primers 



AAAAATTCRAATCTCTCCGAATAAACG 
AAAAACCRAAATAAAAACCACACGACG 



(SEQ ID NO: 15) 
(SEQ ID NO: 16), 



wherein Y is C, T or a mixture thereof, and R is A, G or a mixture thereof. 

30. An assay according to claim 18, wherein the disease or condition to be 
assayed is liver cancer. 

31. An assay according to claim 30, wherein the amplifying step is used to 
amplify a target region within the region of the GST-Pi gene and its 
regulatory flanking sequences defined by (and inclusive of) CpG sites -43 to 
-14. 

32. An assay according to claim 30, wherein the amplifying step is used to 
amplify a target region within the region of the GST-Pi gene and its 
regulatory flanking sequences defined by (and inclusive of) CpG sites +9 to 
+53. 

33. A diagnostic or prognostic assay for a disease or condition in a subject 
said disease or condition characterised by abnormal methylation of cytosine 
at a site or sites within the glutathione-S-transferase (GST) Pi gene and/or its 
regulatory flanking sequences, wherein said assay comprises the steps of; 

(i) isolating DNA from said subject, and 

(ii) determining the presence of abnormal methylation of cytosine at a site 
or sites within the region of the GST-Pi gene and/or its regulatory flanking 
sequences defined by (and inclusive of) CpG sites -43 to +55. 
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34. An assay according to claim 33, wherein the region of the GST-Pi gene 
and its regulatory flanking sequences within which the presence of 
methylated cytosine(s) at a site or sites is determined is selected from the 
regions defined by (and inclusive ofj CpG sites -43 to +53, -43 to +10, -43 to 
-14, +9 to +53 and +1 to +53. 

35. An assay according to claim 34, wherein the region of the GST-Pi gene 
and its regulatory flanking sequences within which the presence of 
methylated cylosine(s) at a site or sites is determined is the region defined by 
(and inclusive of) CpG sites +9 to +53. 

36. An assay according to claim 34, wherein the region of the GST-Pi gene 
and its regulatory flanking sequences within which the presence of 
methylated cytosine(s) at a site or sites is determined is the region defined by 
(and inclusive of) CpG sites +1 to +53. 



37. An assay according to any one of claims 33 to 36, wherein prior to the 
determination step, the isolated DNA is treated such that unmethylated 
cytosines are converted to uracil or another nucleotide capable of forming a 
base pair with adenine while methylated cytosines are unchanged or are 
converted to a nucleotide capable of forming a base pair with guanine. 

38. An assay according to claim 37, wherein the treatment of the isolated 
DNA involves reacting the isolated DNA with bisulphite. 



30 



39. An assay according to any one of claims 33 to 38, wherein the 
determination step involves selective hybridisation of 
oligouucleotide/polynucleotide/peptide-nucleic acid (PNA) probes. 
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40. An assay according to any one of claims 33 to 39, wherein said DNA is 
isolated from cells from tissue, blood (including serum and plasma), semen, 
urine, lymph or bone marrow. 

5 41. An assay according to any one of claims 33 to 40, wherein the disease 
or condition to be assayed is selected from cancers. 

42. An assay according to claim 41, wherein the disease or condition to be 
assayed is selected from prostate cancer, breast cancer, cervical cancer and 

10 liver cancer. 

43. An assay according to claim 42, wherein the disease or condition to be 
assayed is prostate cancer. 

15 44. An assay according to claim 42, wherein the disease or condition to be 
assayed is liver cancer. 

45. A primer or probe comprising a nucleotide sequence selected from the 
group consisting of: 
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25 



20 



CGCGAGGTTTTCGTTGGAGTTTCGTCGTC 

CGTTATTAGTGAGTACGCGCGGTTC 

YGGTT1TAGGGAATTTTTTTTCGC 

YGGYGYGITAGTTYGTTGYGTATATTTC 

GGGAATT1TTTTTCGCGATGTTTYGGCGC 

TTTTTAGGGGGTTYGGAGCGTTTC 

GGTAGGTTGYGTTTATCGC 

AAAAATTCRAATCTCTCCGAATAAACG 

AAAAACCRAAATAAAAACCACACGACG 

TCCCATCCCTCCCCGAAACGCTCCG 

GAAACGCTCCGAACCCCCTAAAAACCGCTAACG 



(SEQ ID NO: 1) 
(SEQ ID NO: 2) 
(SEQ ID NO: 3) 
(SEQ ID NO: 4) 
(SEQ ID NO: 5) 
(SEQ ID NO: 6) 
(SEQ ID NO: 7) 
(SEQ ID NO: 8) 
(SEQ ID NO: 9) 
(SEQ ID NO: 10) 
(SEQ ID NO: 11) 
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CRCCCTAAAATCCCCRAAATCRCCGCG 
ACCCCRACRACCRCTACACCCCRAACGTCG 



(SEQ ID NO: 12) 
(SEQIDNO: 13) 



CTCTTCTAAAAAATCCCRCRAACTCCCGCCG (SEQ ID NO: 14) 



AACTCCCRCCGACCCCAACCCCGACGACCG, (SEQ ID NO: 16), 

wherein Y is a mixture of C and T, and R is a mixture of A and G. 

46. A probe comprising a nucleotide sequence selected from the group 
consisting of: 

Conversion oligonucleotide: 

AAACCTAAAAAATAAACAAACAA (SEQ ID NO: 17) 
GGGCCTAGGGAGTAAACAGACAG (SEQ ID NO: 18) 
CCTTTCCCTCTTTCCCARRTCCCCA (SEQ ID NO: 19) 
TTTGGTATTTTTTTTCGGGTTTTAG (SEQIDNO: 20) 
CTTGGCATCCTCCCCCGGGCTCCAG (SEQID NO: 21) 
GGYAGGGAAGGGAGGYAGGGGYTGGG (SEQIDNO: 22). 



AAAACRCCCTAAAATCCCCGAAATCGCCG 



(SEQIDNO: 15) 
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( Transcription start 




on 1 Exon 2 Exon 3 Exon 4 Exon 5 Exon 




on 7 



5' Flanking Promoter PCR 495-993 

GST-25 -56 

•55 -54 
CGGCCAACATGGTGAAACCCCGTCTCTACTAAAA 
-53 

ATAC A A A A ATC AGCCAG ATGTCGCACGC ACCTAT 
-52 

AATTCCACCTACTCGGGAGGCTGAAGCAGAATTG 
-51 -50 -19 

CTTGAACCCGAGAGGCGGAGGTTGCAGTGAGCCG 

48 -47-46 
CCG AG ATCGCGCC ACTGC ACTCX)AGCCTGGGCCA 

-45 -44 
CAGCGTGAGACTACGTCATAAAATAAAATAAAAT 

AACACAAAATAAAATAAAATAAAATAAAATAAAA 

TAAAATAATAAAATAAAATAAAATAAAATAAAAT 

-43 

AAAATAAAATAAAGCAATTTCCTTTCCTCTAAGCG 

-42 

GCCTCCACCCCTCTCCCCTGCCCTGTGAAGCGGGT 

-41 -40 -39 
GTGC A AGCTCCGGGATCGC AGCGGTCTTAGGG AA 

-38-37 -36 -35-34 -33 

TTTCCCCCCGCGATGTOCCGGCGCGCCAGTTCGCT 

32 -31 -30 

GCGCACACTTCGCTGCGGTCCTCTra 
GST-3 



Exon 7/3'Untranslated PCR 3874-4160 

GST-27 96 



97 

GCCCGGCCCAAGCTCAAGGCCTTCCTGGCCTCCCC 

98 99 
TGAGTACGTG AACCTCCCCATCA ATGGC A ACGGG 

100 

AAACAGTGAGGGTTGCX3GGGACTCTGAGCCKX3AG 

GCAGAGTTTCCCTTCCTTTCTCCAGGACCAATAAA 

ATTTCTAAGAGAGCTACTATGAGCACTGTGTTTCCT 
101 102 103 

GGGACGGGGCrTAGGGGTTCTCAGCCTCGAGGTCG 

TGGGAGGGCAGAG 



Exon 5 PCR 2381-2646 

GST-31 
68 

CCTrCCACGCACATCCTCTTCCCCTCCTCCCAGGCT 

GGGGCTC AC AG AC AG (XCCCTGGTTGGCCC ATCCC 

69 70 
CAGTGACTGTGTGTTGATCAGGCGCCCAGTCACG 
71 

CGGCCTGCTCCCCTCCACCCAACCCCAGGGCTCT 



ATGGGAAGGACCAGCAGGAGGCAGCCCTGGTGG 

72 73 74 
ACATGGTGAATGACGGCGTGGAGGACCTCCG 
GST-32 




Exon 1 PCR 999-1313 

GST-Il -28 



CC AGCTGCGCGGCG ACTCCGGGG ACTCC AGGGCG 
-22 -21 -20 -19 -18 -1 

CCCCTCTGCGGCCGACGCCCGGGGTGCAGCGGCC 
7 -16 -15 -14 -13-12 

GCCGGGGCTGGGGCCGGCGGGAGTCCGCGGGACC 

-11 -10 -9 -8 
CTCCAGAAGAGCGGCCGGCGCCGTGACTCAGCAC 

-7 -6 -5 
TGGGGCGGAGCGGGGCGGGACCACCXrrTATAAGG 
-4 -3-2 -1 12 

CTX^GGAG<XXGCGAGGCCrTCGCTGGAGTTTCGCC 

3 4 5 6 7 

GCCGCAGTCTrCGCCACCAGTGAGTACGCXjCGGCC 
8 9 10 
CGCGTCCCCGGGGA' 
GST-12 




Exon 2/Exon 3 PCR 1318-1920 

GST-14 13 I 

m^mmmS^m^CGGCAGGGCTCCTC 
4 15 16 17 

GCCCACCTCGAGACCCGGGACGGGGGCCTAGGG 

18 19 20 

G ACCC AGG ACGTCCCC A GTGCCGTT AG CGG CTTT 

21 22 23 
CAGGGGGCCCGGAGCGCCTCGGGGAGGGATGGG 

24 25 26 
ACCCCGGGGGCGGGGAGGCKjGOGCAGGCTGCGC 

2728 29 
TCACOXXKXriTGGCATCCrCCCCCGGGCTCCAG 

30 31 
CAAACTTTTCI 1 1 G TTCGCTGCAGTGCCGCCCTA 

32 33 
CACCGTGGTCTATTTCCCAGTTCGAGGTAGGAGC 

ATGTGTCTGGCAGGGAAGGGAGGCAGGGGCTGG 

34 35 
CK3CTGCAGCCCACAGCCCCTCGCCCACCCGGAGA 

36 37 38 

GATCCGAACCCCCTTATCCCTCCGT(X3TGTGGCTT 

39 40 
TTACXXCXKjGCCKXrnCCTrGTTCCCCCKXTCTCC 
41 42 

CG(XATGCCTGCTCCCCGCCCCAGTGTTGTGTGAA 
43 

ATCTTCGG AGG AACCTGTTT ACCTGTTCCCTCCCT 
44 45 46 

GCACTCCTGACCCCTCCCCGGGTTGCTGCGAGGCG 

47 48 49 
GAGT(X}GCCCGGTCCCCACATCrCGTACTTCTCCC 

50 51 52 53 54 

TCCCCGCA( 

GST-15 
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Figure 10 
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Sequence Listings: 

Applicant: Commonwealth Scientific and Industrial Research 
Organisation 

Title: Diagnostic assay 

Prior Application Number: PP3129 

Prior Application Filing Date: 1998-04-23 

Number of SEQ ID NOs : 59 

Software: Patentln Ver. 2.1. 

SEQ ID NO: 1 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 1 

cgcgaggttt tcgttggagt ttcgtcgtc 29 

SEQ ID NO: 2 
Length: 25 
Type : DNA 

Organism: Homo sapiens 
Sequence: 2 

cgttattagt gagtacgege ggttc 25 

SEQ ID NO: 3 
Length: 24 
Type: DNA 

Organism: Homo sapiens 
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Sequence: 3 

yggttttagg gaattttttt tcgc 

SEQ ID NO: 4 
Length: 28 
Type: DNA 

Organism: Homo sapiens 
Sequence: 4 

yggygygtta gttygttgyg tatatttc 

SEQ ID NO: 5 
Length: 29 
Type: DNA 

Organism: Homo sapiens 
Sequence: 5 

gggaattttt tttcgcgatg tttyggcgc 

SEQ ID NO: 6 
Length: 24 
Type: DNA 

Organism: Homo sapiens 
Sequence: 6 

tttttagggg gttyggagcg tttc 

SEQ ID NO: 7 
Length: 19 
Type: DNA 

Organism: Homo sapiens 

Sequence: 7 
ggtaggttgy gtttatcgc 

SEQ ID NO: 8 
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Length: 27 
Type: DNA 

Organism: Homo sapiens 
Sequence: 8 

aaaaattcra atctctccga ataaacg 27 

SEQ ID NO: 9 
Length: 27 
Type : DNA 

Organism: Homo sapiens 
Sequence: 9 

aaaaaccraa ataaaaacca cacgacg 27 

SEQ ID NO: 10 
Length: 25 
Type: DNA 

Organism: Homo sapiens 
Sequence: 10 

tcccatccct ccccgaaacg ctccg 25 

SEQ ID NO: 11 
Length: 33 
Type : DNA 

Organism: Homo sapiens 
Sequence: 11 

gaaacgctcc gaacccccta aaaaccgcta acg 33 

SEQ ID NO: 12 
Length: 27 
Type: DNA 

Organism: Homo sapiens 
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Sequence: 12 

crccctaaaa tccccraaat crccgcg 27 

SEQ ID NO: 13 
Length: 30 
Type: DNA 

Organism: Homo sapiens 
Sequence: 13 

accccracra ccrctacacc ccraacgtcg 30 

SEQ ID NO: 14 
Length: 31 
Type: DNA 

Organism: Homo sapiens 
Sequence: 14 

ctcttctaaa aaatcccrcr aactcccgcc g 31 

SEQ ID NO: 15 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 15 

aaaacrccct aaaatccccg aaatcgccg 29 

SEQ ID NO: 16 
Length: 30 
Type : DNA 

Organism: Homo sapiens 
Sequence: 16 

aactcccrcc gaccccaacc ccgacgaccg 30 
SEQ ID NO: 17 
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Length: 23 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds bisulf ite-converted human GST-Pi gene 

Sequence: 17 

aaacctaaaa aataaacaaa caa 23 

SEQ ID NO: 18 
Length: 23 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds non-converted human GST-Pi gene 

Sequence: 18 

gggcctaggg agtaaacaga cag 23 

SEQ ID NO: 19 
Length: 25 
Type : DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds human GST-Pi gene 

Sequence: 19 
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cctttccctc tttcccarrt cccca 

SEQ ID NO: 20 
Length: 25 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds bisulf ite-converted human GST-Pi gene 

Sequence: 20 

tttggtattt tttttcgggt tttag 

SEQ ID NO: 21 
Length: 25 
Type: DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds non-converted human GST-Pi gene 

Sequence: 21 

cttggcatcc tcccccgggc tccag 

SEQ ID NO: 22 
Length: 26 
Type : DNA 

Organism: Artificial Sequence 
Feature: 
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Other Information: Description of Artificial 
Sequence : Oligonucleotide 

which binds human GST-Pi gene 

Sequence: 22 

ggyagggaag ggaggyaggg gytggg 

SEQ ID NO: 23 
Length: 31 
Type : DNA 

Organism: Homo sapiens 
Sequence: 23 

ttatgtaata aatttgtata ttttgtatat g 

SEQ ID NO: 24 
Length: 25 
Type : DNA 

Organism: Homo sapiens 
Sequence: 24 

tgtagattat ttaaggttag gagtt 

SEQ ID NO: 25 
Length: 27 
Type : DNA 

Organism: Homo sapiens 
Sequence: 25 

aaacctaaaa aataaacaaa caacaaa 

SEQ ID NO: 26 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
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Sequence: 26 

aaaaaacctt tccctctttc ccaaatccc 29 



SEQ ID NO: 27 
Length: 27 
Type : DNA 

Organism: Homo sapiens 
Sequence: 27 

tttgttgttt gtttattttt taggttt 27 

SEQ ID NO: 28 
Length: 2 6 
Type : DNA 

Organism: Homo sapiens 
Sequence: 28 

gggatttggg aaagagggaa aggttt 26 

SEQ ID NO: 29 
Length: 24 
Type: DNA 

Organism: Homo sapiens 
Sequence: 29 

actaaaaact ctaaacccca tccc 24 

SEQ ID NO: 30 
Length: 24 
Type : DNA 

Organism: Homo sapiens 
Sequence: 30 

aacctaatac taccttaacc ccat ?4 
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SEQ ID NO: 31 
Length: 33 
Type: DNA 

Organism: Homo sapiens 
Sequence: 31 

aatcctcttc ctactatcta tttactccct aaa 33 

SEQ ID NO: 32 
Length: 29 
Type: DNA 

Organism: Homo sapiens 
Sequence: 32 

aaaacctaaa aaaaaaaaaa aaacttccc 29 

SEQ ID NO: 33 
Length: 29 
Type: DNA 

Organism: Homo sapiens 
Sequence : 33 

ttggttttat gttgggagtt ttgagtttt 29 

SEQ ID NO: 34 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 34 

ttttgtgggg agttggggtt tgatgtt.gt 29 

SEQ ID NO: 35 
Length: 29 
Type: DNA 

Organism: Homo sapiens 
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Sequence: 35 

ggtttagagt ttttagtatg gggttaatt 29 

SEQ ID NO: 36 
Length: 20 
Type: DNA 

Organism: Homo sapiens 
Sequence: 36 

tagtattagg ttagggtttt 20 

SEQ ID NO: 37 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 37 

aactctaacc ctaatctacc aacaacata 29 

SEQ ID NO: 38 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence: 38 

caaaaaactt taaataaacc ctcctacca 29 

SEQ ID NO: 39 
Length: 32 
Type : DNA 

Organism: Homo sapiens 
Sequence: 39 

gttttgtggt taggttgttt tttaggtgtt ag 32 
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SEQ ID NO: 40 
Length: 3 0 
Type: DNA 

Organism: Homo sapiens 
Sequence: 40 

gttttgagta tttgttgtgt ggtagttttt 30 



SEQ ID NO: 41 
Length: 30 
Type: DNA 

Organism: Homo sapiens 



Sequence: 41 

ttaatataaa taaaaaaaat atatttacaa 

SEQ ID NO: 42 
Length: 34 
Type : DNA 

Organism: Homo sapiens 



30 



Sequence: 42 

caacccccaa tacccaaccc taatacaaat actc 34 

SEQ ID NO: 43 
Length: 26 
Type : DNA 

Organism: Homo sapiens 
Sequence: 43 

ggttttagtt tttggttgtt tggatg 26 

SEQ ID NO: 44 
Length: 26 
Type: DNA 
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Organism: Homo sapiens 
Sequence: 44 

tttttttgtt tttagtatat gtgggg 26 



SEQ ID NO: 45 
Length: 30 
Type: DNA 

Organism: Homo sapiens 
Sequence: 45 

atactaaaaa aactattttc taatcctcta 



30 



SEQ ID NO: 46 
Length: 29 
Type : DNA 

Organism: Homo sapiens 
Sequence : 46 

ccaaactaaa aactccaaaa aaccactaa 29 

SEQ ID NO: 47 
Length: 38 
Type : DNA 

Organism: Artificial Sequence 
Feature: 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 

Sequence: 47 

tgtaaaacga cggccagtgg gatttgggaa agagggaa 38 

SEQ ID NO: 4 8 
Length: 38 
Type : DNA 
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Organism: Artificial Sequence 
Feature : 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 

Sequence: 48 

tgtaaaacga cggccagttg ttgggagttt tgagtttt 38 

SEQ ID NO: 4 9 
Length: 31 
Type: DNA 

Organism: Artificial Sequence 
Feature : 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 



Sequence: 49 

tgtaaaacga cggccagtta gtattaggtt a 

SEQ ID NO: 50 
Length: 37 
Type: DNA 

Organism: Artificial Sequence 



31 



Feature: 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 

Sequence: 50 

tgtaaaacga cggccagtgt tttgagtatt tgttgtg 37 

SEQ ID NO: 51 
Length: 35 
Type: DNA 
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Organism: Artificial Sequence 
Feature : 

Other Information: Description of Artificial Sequence: M13-human 
GST-Pi oligonucleotide 



Sequence: 51 

tgtaaaacga cggccagtgt ttttagtata tgtgg 35 

SEQ ID NO: 52 
Length: 499 
Type : DNA 

Organism: Homo sapiens 



Sequence: 52 
tgcagatcac ctaaggtcag 
tctactaaaa atacaaaaat 
gaggctgaag cagaattgct 
gcgccactgc actccagcct 
aacacaaaat aaaataaaat 
aataaaataa aataaaataa 
ctgccctgtg aagcgggtgt 
cgcgatgtcc cggcgcgcca 
ctgtttactc cctaggccc 



gagttcgaga ccagcccggc 
cagccagatg tggcacgcac 
tgaacccgag aggcggaggt 
gggccacagc gtgagactac 
aaaataaaat aaaataaaat 
agcaatttcc tttcctctaa 
gcaagctccg ggatcgcagc 
gttcgctgcg cacacttcgc 



caacatggtg aaaccccgtc 60 
ctataattcc acctactcgg 120 
tgcagtgagc cgccgagatc 180 
gtcataaaat aaaataaaat 240 
aataaaataa aataaaataa 300 
gcggcctcca cccctctccc 360 
ggtcttaggg aatttccccc 420 
tgcggtcctc ttcctgctgt 480 

499 



SEQ ID NO: 53 
Length: 316 
Type: DNA 

Organism: Homo sapiens 



Sequence: 53 

gggacctggg aaagagggaa aggcttcccc 
agggcgcccc tctgcggccg acgcccgggg 
gagtccgcgg gaccctccag aagagcggcc 
gggcgggacc acccttataa ggctcggagg 
gcagtcttcg ccaccagtga gtacgcgcgg 



ggccagctgc gcggcgactc cggggactcc 60 
tgcagcggcc gccggggctg gggccggcgg 120 
ggcgccgtga ctcagcactg gggcggagcg 180 
ccgcgaggcc ttcgctggag tttcgccgcc 240 
cccgcgtccc cggggatggg gctcagagct 300 
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cccagcatgg ggccaa 

SEQ ID NO: 54 
Length: 603 
Type : DNA 

Organism: Homo sapiens 
Sequence: 54 

cagcatcagg cccgggctcc cggcagggct 
cctaggggac ccaggacgtc cccagtgccg 
cggggaggga tgggaccccg ggggcgggga 
gcatcctccc ccgggctcca gcaaactttt 
tggtctattt cccagttcga ggtaggagca 
ggggctgcag cccacagccc ctcgcccacc 
tcgtgtggct tttaccccgg gcctccttcc 
cccgccccag tgttgtgtga aatcttcgga 
tcctgacccc tccccgggtt gctgcgaggc 
tctccctccc cgcaggccgc tgcgcggccc 
get 
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316 



cctcgcccac ctcgagaccc gggacggggg 60 
ttageggett teagggggee cggagcgcct 120 
gggggggcag gctgcgctca ccgcgccttg 180 
ctttgttcgc tgcagtgccg ccctacaccg 240 
tgtgtctggc agggaaggga ggcaggggct 300 
eggagagate cgaaccccct tatccctccg 360 
tgttccccgc ctctcccgcc atgcctgctc 420 
ggaacctgtt tacctgttcc ctccctgcac 480 
ggagteggee cggtccccac atetegtact 540 
tgegcatget gctggcagat cagggecaga 600 

603 



SEQ ID NO: 55 
Length: 266 
Type : DNA 

Organism: Homo sapiens 
Sequence: 55 

gctctgagca cctgctgtgt ggcagtctct 
tcccaggctg gggctcacag acagccccct 
atcaggcgcc cagtcacgcg gcctgctccc 
gaccagcagg aggcagccct ggtggacatg 
aaatacatct ccctcatcta caccaa 



catccttcca cgcacatcct cttcccctcc 60 
ggttggccca tccccagtga ctgtgtgttg 120 
ctccacccaa ccccagggct ctatgggaag 180 
gtgaatgacg gcgtggagga cctccgctgc 240 

266 



SEQ ID NO: 56 
Length: 2 87 
Type : DNA 

Organism: Homo sapiens 
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Sequence: 56 

tccccctgct ctcagcatat gtggggcgcc tcagtgcccg gcccaagctc aaggccttcc 60 

tggcctcccc tgagtacgtg aacctcccca tcaatggcaa cgggaaacag tgagggttgg 120 

ggggactctg agcgggaggc agagtttgcc ttcctttctc caggaccaat aaaatttcta 180 

agagagctac tatgagcact gtgtttcctg ggacggggct taggggttct cagcctcgag 240 

gtcggtggga gggcagagca gaggactaga aaacagctcc tccagca 287 

SEQ ID NO: 57 
Length: 524 
Type : DNA 

Organism: Homo sapiens 



Sequence : 


57 












ataaaataaa 


ataaaataaa 


ataaagcaat 


ttcctttcct 


ctaagcggcc 


tccacccctc 


60 


tcccctgccc 


tgtgaagcgg 


gtgtgcaagc 


tccgggatcg 


cagcggtctt 


agggaatttc 


120 


cccccgcgat 


gtcccggcgc 


gccagttcgc 


tgcgcacact 


tcgctgcggt 


cctcttcctg 


180 


ctgtctgttt 


actccctagg 


ccccgctggg 


gacctgggaa 


agagggaaag 


gcttccccgg 


240 


ccagctgcgc 


ggcgactccg 


gggactccag 


ggcgcccctc 


tgcggccgac 


gcccggggtg 


300 


cagcggccgc 


cggggctggg 


gccggcggga 


gtccgcggga 


ccctccagaa 


gagcggccgg 


360 


cgccgtgact 


cagcactggg 


gcggagcggg 


gcgggaccac 


ccttataagg 


ctcggaggcc 


420 


gcgaggcctt 


cgctggagtt 


tcgccgccgc 


agtcttcgcc 


accagtgagt 


acgcgcggcc 


480 


cgcgtccccg 


gggatggggc 


tcagagctcc 


cagcargggg 


ccaa 




524 



SEQ ID NO: 58 
Length: 524 
Type : DNA 

Organism: Homo sapiens 
Sequence: 58 

ataaaataaa ataaaataaa ataaagtaat tttttttttt ttaagtggtt tttatttttt 60 
ttttttgttt tgtgaagtgg gtgtgtaagt tttgggattg tagtggtttt agggaatttt 120 
tttttgtgat gttttggtgt gttagtttgt tgtgtatatt ttgttgtggt tttttttttg 180 
ttgtttgttt attttttagg ttttgttggg gatttgggaa agagggaaag gtttttttgg 240 
ttagttgtgt ggtgattttg gggattttag ggtgtttttt tgtggttgat gtttggggtg 300 
tagtggttgt tggggttggg gttggtggga gtttgtggga ttttttagaa gagtggttgg 360 
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tgttgtgatt tagtattggg gtggagtggg gtgggattat ttttataagg tttggaggtt 420 
gtgaggtttt tgttggagtt ttgttgttgt agtttttgtt attagtgagt atgtgtggtt 480 
tgtgtttttg gggatggggt ttagagtttt tagtatgggg ttaa 524 

SEQ ID NO: 59 
Length: 524 
Type : DNA 

Organism: Homo sapiens 



Sequence : 


59 












ataaaataaa 


ataaaataaa 


ataaagtaat 


tttttttttt 


ttaagcggtt 


tttatttttt 


60 


ttttttgttt 


tgtgaagcgg 


gtgtgtaagt 


ttcgggatcg 


tagcggtttt 


agggaatttt 


120 


ttttcgcga t 


gtttcggcgc 


gttagttcgt 


tgcgtatatt 


tcgttgcggt 


tttttttttg 


180 


ttgtttgttt 


attttttagg 


tttcgttggg 


gatttgggaa 


agagggaaag 


gttttttcgg 


240 


ttagttgcgc 


ggcgatttcg 


gggattttag 


ggcgtttttt 


tgcggtcgac 


gttcggggtg 


300 


tagcggtcgt 


cggggttggg 


gtcggcggga 


gttcgcggga 


ttttttagaa 


gagcggtcgg 


360 


cgtcgtgatt 


tagtattggg 


gcggagcggg 


gcgggattat 


ttttataagg 


ttcggaggtc 


420 


gcgaggtttt 


cgttggagtt 


tcgtcgtcgt 


agttttcgtt 


attagtgagt 


acgcgcggtt 


480 


cgcgttttcg 


gggatggggt 


ttagagtttt 


tagtatgggg 


ttaa 




524 
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